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PREFACE 


Communication could be a window to the human mind and behaviour. As a consequence, the 
true rendition of the cognitive and the computational natures of the human mind depend on the 
exploration of various aspects of communication. Since diversity in India is higher in terms of 
communicative means and methods, it becomes quintessential to understand the way cognitive 
and computational aspects of the human mind and behaviour unfolds themselves through 
communications. More specifically, IC3C will focus on the cognitive and communicative 
aspects of verbal and non-verbal communication not only in the case of humans but also in the 


case of non-human communication from different perspectives. 


This conference proceeding, remembering this theme, contains papers chose for presentations 
in the technical sessions of IC3C 2021. We are grateful to our amazing group of reviewers from 
everywhere the country who merit full recognition for the difficult work of reviewing the great 
entries with arrive at specialized substance. We are profoundly thankful to Dr. Bruce Balentine 
Chief Scientist, Enterprise Integration Group, USA, Dr. Karin Verspoor, Director, Health 
Technologies, Melbourne School of Engineering, Australia, Dr. Samudra Vijaya K Visiting 
Faculty NMIMS, Mumbai Dr. Monojit Choudhury, Principal Researcher, Microsoft Research 
Lab, India for giving the keynote addresses at IC3C. We might likewise want to thank the 
members of coordinating committee, organizing committee and advisory committee for their 


support and co-operation in making IC3C 2021 a triumph. 


Soumya Sankar Ghosh 
Samir Karmakar 
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About VIT Bhopal University 


VIT is known for inspiring young aspirants by creating new trends around the globe. VIT has 
established collaboration with reputed organizations and strategic partnerships with 
universities around the world to prepare a globally young competent generation of 
professionals. VIT Bhopal University is the first and only private university in India to have 
100% doctoral faculty. This university is also known for its very own CALTech learning 
process. 

The VIT legacy, which started from Vellore Engineering College 35 years ago to four 
Universities in India, today, is a result of constant efforts to impart high quality of education. 
The Govt. of India has recognized VIT as No. | Private Institution for Innovation, ARITA 2019, 
and the recent recognition, as an Institution of Eminence, has paved a way to focus more on 
research with international collaborations to move up in the global ranking and contribute in 


the capacity building of our nation. 


About Jadavpur University 


Jadavpur University is at present an internationally recognised premier university of the 
country. It is an urban university with a global perspective and is dedicated to creating leaders 
who will fashion a more humane and just world. Following in the footsteps of its predecessor, 
the University not only engages itself in teaching and research of international quality, but also 
provides societal services for the benefit of rural and urban populations. The programmes are 
directed towards achieving excellence in education and extending the benefits of improved 


technology to the society as a whole. 
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Dr. G. VISWANATHAN 

Founder & Chancellor 

Former Member of Parliament 

Former Minister, Govt. of Tamil Nadu 

President, Education Promotion Society for India, New Delhi 


MESSAGE 


I am extremely happy to note that the School of Advanced Sciences and Languages (SASL) 
and the School of Computing Sciences and Engineering (SCSE) are organizing the First 
Chapter of 7he International Conference on Communication Computation and 
Cognition (IC3C 2021) in collaboration with the School of Languages and Linguistics, 
Jadavpur University, Kolkata, West Bengal from January 29th to January 31st, 2021. 
The cornerstone of VIT Bhopal University is embedded in research and innovation as the 
members are ardently engaged in various research activities throughout the year. 


The field of Advanced Sciences along with Computing Science and Engineering plays a 
crucial role in uplifting the standard of living, national security and overall development and 
the field of Language and Linguistics is very closely connected to both these domains. The 
true nature of scientific queries and computational modeling of language finds its 
appropriate place in various aspects of communication and collaboration of these fields will 
foster real research. It is heartening that both SASL and SCSE, have been working in the 
path of research and have successfully set up an environment for its professors, research 
scholars and students to grow and achieve new heights. Attending conferences like these 
kindles the thirst for new knowledge and exposes one to the latest trends in the field of 
research. One can also gain knowledge about new techniques, new types of equipment, 
data that is yet to be published and investigations that are still in their infancy. 


I would like to congratulate the members of the SASL, SCSE and the members of the 
organizing committee, in particular, for their endeavors in organizing a conference of this 
scale. My best wishes to all the delegates. 


I wish the-conference all success. 





Dr. G. Viswanathan 


pansy - 632 014, Tamil Nadu, India. Tel. : +91 416 2243100 Fax : +91 416 2243092 E-mail : chancellor@vit.ac.in 
Chennai Campus : Vandalur - Kelambakkam Road, Chennai - 600 127, India. Tel.+ 91 44 3993 1555 Fax: +91 44 3993 2555 


www.vit.ac.in 
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SANKAR VISWANATHAN 
Vice President 


MESSAGE 


I am happy to note that the School of Advanced Sciences and Languages (SASL) and the 
School of Computing Science and Engineering is organizing the First Chapter of The 
International Conference on Communication Computation and Cognition (IC3C 2021) in 
collaboration with the School of Languages and Linguistics of Jadavpur University, Kolkata, 
West Bengal, from January 29th to January 31st, 2021. 


Communication is primarily the window to the human mind and behavior. When we try to 
penetrate someone's mind through this ‘window’ the question that arises is ‘How does a 
speaker achieve a certain communication goal by using certain means and methods in a 
particular communicative situation?’ A conference like JC3C 2021 offers an ideal platform for 
researchers, practitioners, and educators to present and discuss more on pertinent 
questions and issues like this. I am sure that this conference will prove to be an ideal place 
for all the researchers to showcase their ideas and also benefit from the expertise and 


knowledge of others. 


The Souvenir of the First Chapter of International Conference on Communication 
Computation and Cognition (IC3C 2021) of VIT BHOPAL is replete with the valuable 
messages of our patrons, inspiring stories of eminent scholars along with abstracts of the 


presentations. 
I congratulate the organizers for their tireless efforts in organizing a conference like this. 


Best wishes for the success of the event. 


Vellore - 632 014, Tamil Nadu, India; Phone: 91 - 416 - 2243091 (10 Lines) Fax: 91- 416 - 2243092 
E-mail: registrar@vit.ac.in 
www.vit.ac.in 
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28th January 2021 
MESSAGE 


Iam immensely pleased to learn that the School of Advance Sciences and Languages (SASL) 
and School of Computing Science and Engineering (SCSE) of VIT Bhopal University in 
collaboration with the School of Languages and Linguistics (SLL) of Jadavpur University, 
Kolkata is organizing an International Conference on Communication, Computation and 
Cognition (IC3C-2021) from 29" January to 31° January 2021 in Virtual Mode. 


Everyone understands that the knowledge of computers is required for every walk of our life. 
The application of computers covers all discipline of science, engineering, technology and 
languages. Further, the employability is increased with expertise in computers knowledge to 
work in a variety of fields from aircraft design to energy infrastructure to weather prediction 
and climate research Computational power is used to make artificial intelligence, 
communication in terms of capture and transfer the information including the speeches. The 
theoretical and practical knowledge of advanced computer programming techniques and 
intelligent systems will help to understand the human and non-human cognitive and 
communications. 


I am confident that this International Conference on Communication, Computation and 
Cognition (IC3C 2021) would provide a common platform for all students and researchers in 
the area of languages and computer engineering to have discussion and share their knowledge 
and experience. I extend my hearty congratulations to the organizing secretary and organizing 
team of the conference for their great initiative. 


I wish the conference a great success. 
va) — 


P. Gunasekaran 


Kothrikalan, Bhopal-Indore Highway, Sehore Dist, MP-466114 
Email:vc@vitbhopal.ac.in; Phone: 91-7560-254575; Web:www.vitbhopal.ac.in 
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Date: 28" January, 2021 


Dr. Samir Karmakar 
Director, School of Languages and Linguistics 
Jadavpur University 


It gives us immense pleasure to be part of International Conference on Communication, 
Computation and Cognition (IC3C 2021) which is organized by School of Advanced Sciences 
and Languages and School of Computing Science and Engineering of VIT Bhopal University in 
collaboration with School of Languages and Linguistics of Jadavpur University, Kolkata, India. 


The complexities involved in the study of Cognition, Computation and Communication seek the 
active collaboration among the different academic disciplines as well as between the Institutions 
and Universities. We are lucky enough to have supports both from the academics and the 
Institutions in spite of the fact that IC3C is initially conceived by two individuals. In due course 
of time, we received useful inputs (both academic and administrative) from our colleagues. 


We strongly believe that IC3C will provide an opportunity to share ideas with researchers from 
different universities, across the world. We appreciate the efforts of the organizers in carefully 
planning and ably executing the plans to make the event a grand success. 


I wish the conference all success. 
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A Study on the Affective Factors in Writing Skills of the Second 


Language Learners at Tertiary Level 
K. Jefferson’ and V. Radhakrishnan? 
'K.S.Rangasamy College of Arts and Science (Autonomous), India 
*Kongu Engineering College, India 


Keywords: Abstract 
Writing skills 


Affective factors Strategies Even after undergoing years of education through the medium of English, 


Solutions. Second Language learners at the tertiary level find it quite challenging to 
write a piece of text without errors. This creates everyday problems in their 
academic and career life. It is seen that schools train their students to produce 
desired results in exams, particularly in core subjects instead of preparing 
them to face the world on their own. This kind of training which spoon feeds 
the students to achieve the goals of schools puts a strain on the cognitive 
ability of students to think on their feet. There are several other underlying 
factors that affect the learners’ ability to write in English. This paper tries to 
identify the issues that hamper the second language learners from acquiring 
the required level of writing skills. This paper also tries to address the 
problems faced by students by providing workable solutions and strategies 
to improve their writing skills. 


1. Introduction 


Language skills are significant for the second language learners especially to those 
studying in technical institutions. Acquiring a good level of proficiency in language skills 
enhances the communicative competency of the learners. The four prominent skills of English 
language are listening, speaking, reading and writing and to enhance those skills and to be 
proficient in using those skills learners need special attention. Among the four skills, writing is 
an important productive skill which is very difficult for the second language learners to master. 
It is a psychological process that needs lot of motivation and enthusiasm for achieving better 
result. Learners face many challenges and hindrance in language acquisition because of the 
different affective factors such as personality, attitudes, anxiety and motivation. The 
aforementioned are the most important affective factors in writing skills. This paper focuses on 
how anxiety and motivational factors affect the learner’s writing. Several affective factors were 
identified through the student’s response with the help of a questionnaire. Appropriate strategy 
needs to be identified to overcome these challenges and improve the writing skills of learners 


1020442. 
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at the tertiary level. Every individual needs writing skill as it is very important in their daily 
activities and competence in writing continues to be essential when the individuals become 
mature and working adults (Fakeye and Ogunyemi, 2010). 


1.1. Literature review 


According to Kolawole (2003), among other language skills writing skill occupies a 
more prominent position and it is very important in academic advancement. Also it involves 
more cognitive process. The learners come across many challenges while writing a piece of 
text. The two main factors that affect the writing skills of the learners are motivational factors 
and anxiety factors. Do“rnyei, (2001) stated that in language acquisition motivation is an 
essential element and also a dynamic process subject to constant change. It is a complex 
construct, “with many components such as interest, intrinsic motivation, task values, self- 
efficacy, and goal orientations” (Ong, 2015, p. 137). Anxiety is one of the affective factors that 
creates stress and worry to learners while writing and also leads to avoid or give up writing 
(Blasco, 2016). Hassan, (2001) said that situation specific anxiety leads to general avoidance 
of writing behavior. Writing anxiety is otherwise called as writing apprehension or blockage 
which is defined by Ibrahim and Noor (2011) as a subjective complex of attitudinal, emotional, 
and behavioural interaction which reinforces each other. In addition, it directly affects the 
complexity, accuracy, and fluency of L2 learner’s writing performance (Zahibi, 2018). 

1.2. Research questions 

1. Is the proposed strategy suitable to identify solutions to the select affective factors in 
writing skill? 

2. What extent will it facilitate the learners in enhancing the writing skills at the tertiary 
level? 

1.3. Limitations of the Study 

This study is limited to the tertiary level learners in an engineering institution. 

Questionnaire and the discussion are focused on the selected level of learners. 


2. Tools used 


Questionnaire was given to the tertiary level students to identify the affective factors 
that are really affecting their writing skills. Questions were framed to check the motivational 
and anxiety oriented affective factors. 


Figure: 1 


4. Your interest to write something in English 


80 responses 


@ Excellent 
@ Good 
@ Average 
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The figure: 1 shows the results of the questionnaire asked to 80 students about their 
interest in writing a piece of text in English language. The result is seen as 63.7% of learners 
expressed their interest towards writing is good that shows the reality of the requirement of 
writing skills at the tertiary level engineering graduate and for future profession. At the same 
time, 28.7% said their interest as average to write. Finally, only 7.5% students said as they have 
excellent interest in writing something in English. From these results the researcher can 
understand that the students’ level of interest to write in English is fair and also the picture 
revealed that the learners have some obstruction to write in English. 


3. Identification of Problems 


A questionnaire was given to identify the major affective factors that affect the writing 
skills. This question was asked in particular to check the levels of affective factors and also it 
covered the two major affective factors as motivational and anxiety related issues in producing 
a text which is mentioned in figure:2 
Figure: 2 


5. What prevents you to write (Select as many as relevant to you) 


80 responses 












Boredom process 14 (17.5%) 


Lack of Ideas 17 (21.3%) 
Less knowledge about English —16 (20%) 
Less word stock 23 (28.7%) 
Worried about mistakes —35 (43.8%) 


Lack of self - confidence 20 (25%) 





Slackness 4 (5%) 


Lack of motivation 18 (22.5%) 


fe) - 10 20 30 40 


From the above picture researcher is able to locate the prime factors that affect the 
learners of second language. Some of the attitudinal issues are boredom process and slackness. 
These two received 17.5% and 5% respectively. This result is actually low in comparison with 
other factors anxiety and motivation. Motivational factors are lack of self-confidence and lack 
of motivation that received 25% and 22.5% respectively. The major hindrance to write is 
anxiety related issues that are clearly seen from the percentages for lack of ideas 21.3%, less 
knowledge about English 20%, less word stock is 28.7% and 43.8% students said as they are 
worried about mistakes. Lack of ideas, less knowledge about English and less word stock are 
the back stage reason for the prime issue worried about mistakes. The three problems lack of 
ideas, knowledge and word stock that really makes the learners more anxious in writing. For 
the discussed problems some of the new strategies are proposed in the following paragraph 
methodology and discussion. 
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4. Methodology 


To overcome the above identified barriers better strategy is required to enhance their 
writing skills as well as some motivational activities. Since, the students have several 
psychological impediments as described in the above passages the teachers at present ELT 
scenario are in need of some transformational methods to improvise writing. This study 
proposes graded learning strategies and some activities to overcome the affective factors that 
are discussed earlier. Language learning should take place along with comprehension, interest, 
creativity and less anxiety. 

Graded learning that proposed the idea of learning something gradually from basics to 
the core of a specific subject. Figure: 3 describes the proposed learning methodology in three 
steps. At first contextual words are to be introduced and familiarized; and then relevant 
situational phrases and various grammatical structures are taught. In the second step, learners 
are expected to use the words, phrases and grammatical structures while speaking in a context. 
And finally, learners are asked to write a paragraph on the same theme. 

Figure: 3 — Method to Enhance Writing 


STEP -I STEP - I 


¢ Learning Words and « Apply in context during 
Phrases in context conversation 


STEP-III 


« Apply words and 
phrases in writing. 


Ca Gse-vonvoer-ve 





In addition, three activities are proposed to enhance writing skills namely increasing 
word power through pictures, group discussion and case study. 


4.1. Word Power through pictures 


Thematic pictures, for example, ‘Traffic’ can be given to the students to generate words 
which are to be used later in writing a paragraph. 


4.2. Group Discussion 


Group discussion topic based on same theme can be assigned for students to discuss. 
This activity will help the students to generate and collect idea about the topic and get 
familiarize with words, phrases and grammatical structures. 


4.3. Case Studies 


Finally, case study can be given to the learners to understand the issues and challenges 
faced by the people of their city due to traffic congestion. Students are asked identify the issues 
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by interacting with the residents of the city. To support their points photographs of traffic issues 
in a city can be captured. After completing the discussion, students can also share their ideas 
with their classmates about the issues they found in their locality. After the peer discussion, 
students are expected to compose a report on the case study 

Processes that are discussed in this paper is designed based on the learners’ needs as 
per the process product approach. Steps involved in this process are given below; 


Figure: 4 an excerpt from https://www.thinkport.org/grr/in-action.html 


Teacher Responsibility 










Focused 


Instruction "I do it." 


Guided 
Instruction 
Qf fonsnsiaive "You do it together." 
\ / Independent 
Learning 


Student Responsibility 






"You do it alone." 





The role of teachers in enhancing the writing skills of tertiary level learners is very 
challenging. A shift from the conventional way of teaching to Gradual Release of 
Responsibility (GRR) is very important to meet the need of the current generation learners. It 
is a framework that basically changes the role of a teacher as a facilitator. It has the freedom to 
make a stage for the learners to work together, to build capacity of the learners to think back 
and forth to access the required result. It makes the students’ learning process as a dynamic and 
responsive to their needs. This framework can be applied in graded learning strategy to enhance 
the writing skills using focused and guided instructions to the learners. As mentioned in the 
figure 3, focused and guided instruction is given to the students to develop vocabulary at first, 
discussion on the given theme is an instance for collaborative learning and finally learners are 
expected to produce text on their own is independent learning. This GRR platform helps to 
overcome the motivational and anxiety based affective factors and also that boosts the 
confidence level to acquire their required skill. 


5. Discussion 


In figure.2, shows various difficulties faced by students to write or produce a piece of 
text in English. Students doing professional courses are definitely in need of writing skills to 
achieve success in their academic as well as professional life. From figure.2, it is very clear that 
17.5% of students feel writing skill is a boredom process. It is because of the reason it consumes 
time and needs patience to produce a precise text. Students also feel that they spend less time 
for reading in the present scenario and lot of distractions stop them from concentrating the 
structure and form of the text, in depth analysis of a text while reading which are some of the 
reasons for affective factors like lack of ideas (21.3%), less knowledge about English (20%) 
and less word stock (28.7%). Learners’ anxiety (43.8%) is higher because the students bother 
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too much about the mistakes. This is because of the reason that English is taught as a subject 
which involves result oriented or mark oriented teaching. Focusing on results make the learners 
get tensed and worried about mistakes because students feel that if they make mistakes that will 
affect their marks. Instead of focusing on the marks during evaluation in language teaching, 
how far the learners are able to produce language can be checked by what students can write in 
English as it is used in Common European Framework Reference (CEFR) based language 
assessments. 25% of students say that they lack confidence to write because of their problems 
in spelling, grammar and language structures. In classrooms students get less motivation 
(22.5%) to write because of lengthy lectures and less activity inside the classroom. 

The above issues are encountered by proposing new strategies to enhance language 
proficiency with the specific focus on writing. Language acquisition is educated based on the 
required needs of the learners. It is practiced in context using Graded Learning Strategy along 
with the Gradual Release of Responsibility (GRR) for the better outcomes productive 
communication. The benefits of the proposed idea are learning words and structures in context, 
applying in conversations and discussions and writing an error free paragraph. Providing 
thematic topics will help the students to become more familiar with the words, phrases and 
grammatical patterns. Using same theme for speaking and writing will aid the students to learn 
the skills better in context and also have good memory retention. 

The activities that have been introduced in this paper helped learners to develop ideas, 
confidence on using language, reduce anxiety level, avoid boredom process, reduce mistakes, 
avoid slackness and learners get motivated to write. The psychological issues of a language 
learner in writing are clearly identified and rectified. The proposed method helps learners to 
enhance their cognitive ability to write better because of the activities. Several solutions are 
given through the suggested methods and strategies to overcome affective factors anxiety and 
motivational factors. 


6. Conclusion 


In this paper entire focus is given on major two affective factors in writing like 
motivational factor and anxiety factor. These two factorial issues were identified from the 
learners at the tertiary level in an engineering college. Graded learning activities to enhance the 
writing skills were proposed along with Gradual Release of Responsibility framework to 
facilitate teachers to provide learning platform for the learners. 
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Ecolinguistics 

Eco-Critical Discourse Ecolinguistics is an area of study that involves the combination of two 
Analysis disciplines namely Ecology and Linguistics. Eco-critical Discourse 
Frames Analysis (ECDA) is one of the approaches to Ecolinguistics which involves 
Meat Metaphor critical discourse analysis of texts that impact our thoughts and behaviour, 


with regard to the ecology. Our thoughts, concepts, ideas, ideologies and 
worldviews are shaped through language and they in turn influence how we 
treat each other and the natural world. Ecolinguistics shows how linguistic 
analysis can help expose the “stories we live by”, open them up to question, 
and contribute to the search for new stories (Stibbe, 2015). Linguistics 
provides tools for analysing and revealing the stories that lie hidden 
between the lines of the texts, questioning these stories from an ecological 
perspective, and then finding ways of resisting or promoting them, 
depending on whether the stories are destructive or beneficial to the eco- 
systems on which life depends.The present study focuses on one such story 
of Metaphors. It aims to identify and analyse ways in which Metaphors are 
used by food shows specifically with reference to animals and the 
consumption of meat. Certain species of animals, those which are 
portrayed and marketed as gourmet in a set of popular food shows on 
television are of particular interest for the purpose of this study. The meat 
industry is responsible for the ecologically damaging effects of intensive 
animal farming involving consumption of large amounts of resources and 
significant pollution. This paper maps and analyses multiple destructive 
metaphors about meat consumption which are positively advertised by 
popular food shows such as; Man vs Food, Carnivorous, Bizarre Foods 
with Andrew Zimmern, Food Paradise and Food _ Paradise 
International. The framework used in this study, draws on Stibbe (2015), 
which differs somewhat from the most common way of describing 
metaphor in cognitive science (Lakoff and Johnson, 1999). Here metaphors 
are treated as a type of framing — one where the source frame is from a 
specific, concrete and imaginable area of life which is clearly different 
from the target domain. The target domains are established based on the 
metaphors observed in the language used in the food shows. In this study, 
the concept of 'meat' is the target domain, which is mapped onto by various 
source frames. A total of twelve metaphors have been identified and 
discussed. 
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1. Introduction 


The conceptualisation of the world around us is influenced by the way we think about 
the events and entities that are a part of it. Language plays a key role in influencing how we 
think and in turn, how we act based on our thinking about the world. Ecolinguistics 
combines the two disciplines of ecology and linguistics. In a talk given by Einar Haugen in 
1970, the term 'Ecolinguistics' first came into existence. According to Haugen (1972), 
'Ecology' is understood metaphorically and transferred to ‘languages in an environment. 
Alexander & Stibbe (2014) use the term ‘Ecolinguistics’ to talk about how language is 
involved in forming, maintaining, influencing or destroying relationships between humans, 
other life forms and the environment. Ecolinguistics involves the study of the role of 
language in a range of issues such as climate change, conservation or destruction of 
ecosystems, new practices at curbing language death, cultural or environmental 
degradation, and celebratingthe symbiotic relationship between ecology and all the factors 
influencing sustainability positively. According to Stibbe (2015), Ecolinguistics, more 
specifically Eco-Critical Discourse Analysis studies various types of texts and discourses 
which have the potential to encourage people to think and act in ways which damage or 
preserve ecological systems. 


The advent of food shows on television has aided the framing of food to evolve over the 
years. Food shows, in order to run successfully on television promoted the domination of 
appearance over taste, or for that matter, even health. This made the concept of buying food 
instead of cooking it at home a viable option, by extending the function of food beyond that 
which provides the body with nutrients. Commercializing food and projecting it in attractive 
ways, little by little introduced norms of its own. These norms or patterns of thinking about 
food have a range of effects, most of which have a negative impact on not just our health but 
also on the environment. This paper studies the effects of the discourse of food shows that 
promote the consumption of meat and the implications it has, not only on the way in which 
animals are portrayed and treated from a utilitarian perspective, but also the way in which 
certain species of animals are marketed as gourmet food. The meat industry is responsible 
forthe brutal treatment perpetrated towards animals by keeping them in terrible conditions 
and treating them as mere 'meat products'. Added on, are the excessive ecologically 
damaging effects of intensive animal farming such as, consumption of a large amount of 
resources, increase in green house gases and the resulting pollution. The food shows 
discussed in the paper use discourse that frequently promotes the consumption of meat in a 
variety of ways. 
Man vs food is a popular American television food series taking on competitive eating 
challenges in different food joints across the United States. Food Paradise is a widely 
popular show featuring the best places to eat in America. Food Paradise International is 
based on a similar premise as its predecessor Food Paradise. The only difference is that the 
host aims to discover special cuisines of various restaurants across the world, instead of just 
America. Bizarre foods with Andrew Zimmern, is a travel and cuisine show where the 
host travels the world to try out regional cuisines of different countries. Carnivorous is a 
food show where the host meets up local farmers, butchers, ranchers and cooks to find out 
more about their favourite 'meat recipes’. 


2. Metaphor 


In Cognitive Linguistics, a metaphor is described with respect to two concepts 
namelythe source domain and the target domain. Metaphor is viewed as a mapping across 
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conceptual domains, i.e. from source domain to target domain (Lakoff and Johnson, 1980). 
The relationship between metaphors and frames can be understood as: Metaphors use a 
framefrom a specific, concrete and imaginable area of life to structure how a clearly distinct 
area of life is conceptualised. The two concepts overlap, however, and are frequently used 
interchangeably. Thus, is makes sense to talk of metaphor as a mapping from a source frame 
to a target domain. Therefore, the framework used in this paper is similar to the one stated in 
Stibbe (2015), where metaphors are a type of framing — one where the source frame is from a 
specific, concrete and imaginable area of life which is clearly different from the target 
domain. 


Metaphors can be considered to be powerful devices which can influence our minds by 
restructuring how we think about particular areas of life. Keeping this in mind, the conceptof 
'‘meat' is the target domain, which is mapped onto by the various source frames that will be 
discussed in this section. The animals consumed as food are collectively referred to as 
‘meat’, for the purpose of this study. The analysis of the five food shows brings forward a 
range of metaphors, which are discussed in the following sections, and which 'exploit' the 
domain of 'meat' in a multifarious manner. 


2.1. MEAT IS AN ART 


The 'MEAT IS ART' metaphor is prevalent in the discourse of the analysed food shows. Art 
can be defined in two distinct ways. In one case, it refers to a finished work or product, 
whereas alternatively it can also refer to the process of creating a work or product. 
Generally, entities such as paintings, sculptures, photographs or craftworks can be 
characterized as archetypal examples of 'artworks' which are created or produced by agents 
such as painters, sculptors, photographers, craftsmen or altogether by ‘artists’. Keeping in 
mind these common ways of understanding terms such as 'artwork' and ‘artist’, script writers 
of different food shows exploit the domain of ‘art’. At the least destructive end of the 
mappingof ‘art’ as 'meat', would be looking at cooks or chefs as artists who create new dishes 
using meat. Just as a painting is appreciated based on its creativity and visual effect that it 
has onthe viewer, meat is talked of on basis of the same elements with the dimension of taste 
added on. Take for example the following examples: 


1. It takes every second of that 36-hour process to turn this prime pork belly into 
asucculent work of art. 

2. A 2-inch-thick rib roast from Australian-bred cows so succulently marbled it 
belongs in a museum. Look at this! That is one fine piece of beef. 

3. This tastes like a delicate veal sausage,...this is an amazing piece of art...oh is 
a Michelangelo kind of masterpiece. 

4. On top of the pork come the French fries, that's right folks, fries come inside 
thismeat masterpiece. 


Here, the target domain is meat and the source frame is art, as in the concrete entity.In the 
above examples, the finished product which is pork belly, a rib roast, a blood sausage and a 
pork sandwich respectively, are compared to a piece of 'art', i.e. parts of animals and actual 
blood in the case of the blood sausage are considered to be as valuable and noteworthy as a 
masterpiece created by an artist. There is an objectification of the inanimate parts of a dead 
animal along with simultaneously, bringing up the frames of an artist and a cook/chef, a 
metaphorical reasoning pattern is set up. A slightly different interpretation is the process of 
making an artwork such as a sculpture which too is referred to as ‘art’. Just as tools mainly 
such as chisels and hammers, are used to carve a sculpture, tools like knives are used to cut, 
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slice or carve food items, particularly 'meat'. More appropriately, the end product being the 
sculpture is compared to a meat featured dish with lexical items such as 'art of carving 
meat', 'art of stacking meat’, 'meat marbling' 'carved paper-thin'. Here, the target domain is 
meat and the source frame is that of an artist. Consider the following examples: 


5. And he's an artist, so of course there's an art to cooking his steak. 

6. Known as the home of deboned chickens, a bear's has perfected the art of 
stuffingvirtually every kind of meat imaginable. 

7. ...nothing left for bread but to roll up his sleeves and to get a little Sun on the art 
ofmeatloaf making. 


The clauses ‘art of cooking steak', ‘art of stuffing meat’ and ‘art of meatloaf making’, 
showcase art as not just the meat product, but also, as the process involved in making it. 
This further entrenches the notion of 'meat as art’. This mapping of cooks or chefs being 
sculptors or artists is further entrenched by referring to butchers as 'professional carvers of 
meat'. The metaphor 'BUTCHERS ARE SCULPTORS' or 'COOKS ARE SCULPTORS' 
emerges as a consequence. Alternatively, the skills of a butcher are compared to the skills 
of a surgeon who makes incisions and performs surgery. Here, the source frame is that of a 
doctor. 


8. It's customary to entertain the guest with the most lavish banquet possible. That 
means sacrificing one of the family's goats. What happens next is the most 
impressive display of cutting skills I've ever seen outside of a hospital operating 
room. 

The conceptualisation of 'Butcher stores' as being equivalent to ‘art galleries’ is a 

metaphorical reasoning pattern that is based on considering butchers or cooks as artists or 
sculptors. The following examples from Food Paradise International illustrate this point: 


9. It's almost part butcher store, part like an art gallery. 
10. Dig further into this meat gallery, and you'll find other gems, too, like the Wagyu. 


Here, the source frame is an art gallery. A butcher store and a restaurant are comparedto an 
art gallery based on the conceptualisation of butchers as artists. Just as pieces of art are put 
on display for sale, cuts of meat occupy the same place at a butcher store and at a 
restaurant. Therefore, the cuts of meat are considered as pieces of art, preparing meat or 
meat dishes is considered as an artistry, cooks, chefs or butchers are considered as artists or 
sculptors and places where meat is sold or served is considered as an art gallery. 


Artists use different colours to paint a picture and cooks or chefs use different food items to 
make a dish. Therefore, cooks or chefs have also come to plate dishes in ways that appeal 
to the eyes of the eater. At times, to plate a dish in a way that the chef desires, the 
ingredients of the dish are manipulated. In the case of meat, it is cut in different ways or is 
produced in ways that appeal to the chef. Animals are genetically modified, or are raised in 
ways that go against the norm. Experimentation with food leads to remodelling of current 
practices and coming up with new, better measures of meat production. To get the perfect 
piece of meat or to make the process of slaughtering easier, animals are kept in confined 
spaces which affect their health and life adversely. Intensive animal farming is a major 
sourceof greenhouse gases (GHGs) and other pollutants. There is an increasing production 
and consumption of meat in Western countries and other countries with existing meat 
cultures asa result of increased affluence, urbanization, transport, and refrigeration and 
freezing technologies. The conveniently available meat products along with the inescapable 
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reality of television and social media who actively promote meat as an essential 
commodity, makes the act of purchasing meat products almost mechanical. The discourse 
of texts such as advertisements, manuals, news reports and others have hidden metaphors 
making consumers of meat become uncritical, in a sense, about the act, process and 
treatment endured by many animals as well as the destructive effects it has on the ecology. 


2.2. MEAT IS A BRAND 


Companies are known to be represented by an identifying symbol, logo, name or word that 
distinguishes their products from those of others. Meat is a food product that is sold at 
various food stores and is often branded in ways that appeal to the buyers. Different types 
of meat products are also thus, identified based on their brand. The term ‘designer’ in the 
analysed examples, has two variant meanings. In one case, the meaning of the term can be 
linked to the prototypical definition of a product made by a designer. Whereas, the other 
interpretation is based on the meaning of ‘designer meat' which can be defined as meat that 
is nutritionally manipulated. In the following examples, meat is equated to a product made 
by a fashion designer. 


11. This is more than just a high-end butchery. It's where you find luxury beef 
atdesigner prices. 

12. We start with a different kind of oyster these are farm raised oysters. I like to 
callthem they're Gucci oysters 

13. I'm gonna give you something that no one else in the world has it's called prized 
Wagyu designer steak. 


Just as designer clothes or jewellery are considered to be expensive and luxurious, beef is 
metaphorically described as a designer product that holds a similar meaning. Designer 
clothes are considered to be high-end as compared to other clothes and thus, designers 
possess a prestigious status. In the twelfth example, oysters are referred to as 
Guccioysters, to imply that they are designed (farm-raised) and thus, equated to a famous 
fashion brand named Gucci. In the thirteenth example, 'designer steak' is used to refer to 
cultured meat that is fortified with health promoting ingredients. It is also described as 
being expensive and exclusive which is what gives the steak its designer status. Therefore, 
in these examples, the target domain is meat whereas the source frame is a brand or a 
designer product. This conceptualisation gives rise to the 'MEAT IS A BRAND' metaphor. 


2.3. MEAT IS A CAR 


A car is a vehicle used for transportation. The metaphor 'MEAT IS A CAR' emergesin the 
following examples, where either an animal or its meat is referred to as a car. In the 
fourteenth example, a breed of cattle is referred to as the Cadillac of Italian cattle. A 
Cadillac is considered to be a luxury car and thus, by referring to one breed of cattle by its 
name, the meat of these cows is indirectly deemed as being better. In the fifteenth example, 
the term 'test-drive' is used for a piece of meat. The activity of trying out a steak is 
compared to taking a new car for a drive before buying it. Example sixteen is similar to 
example fourteen where apiece of meat is metaphorically compared to a luxury brand of 
cars. A Rolls-Royce like a Cadillac is considered to be one of the best cars in the market. 


14. For this Italian beef tagliata, Chef Maggio uses Fassone piemontese. It's the 
Cadillacof Italian cattle from the hills of Northern Italy. 
15. Want to test-drive these extravagant steaks? Let's see if they're worth every penny. 
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16. The steak at Victor Churchill is probably the Rolls-Royce of steaks that you can get. 


Here, the source frame is that of a car which is mapped onto the target domain ofmeat. 
The use of brand names in some respects is similar to the use of designer brands that were 
analysed in the previous section. The purpose behind their usages is to simply brand or 
create a prominent identity for the described meat products, so as to promote their 
consumption. 


2.4. MEAT IS A DRUG 


The 'MEAT IS A DRUG' metaphor emerges from the use of the term 'crack' for meat. 
Crack is a type of drug made up of cocaine. In the following example, grilled chicken is 
compared to 'crack'. Also, the urge felt by a drug addict who would be unable to resist 
drugsis also compared to a person being unable to resist having a chicken dish, if he went 
walking by the claimed restaurant. 


17. And wait until you taste their spicy, smoky, crispy rotisserie grilled chicken that's 
so freakin' delicious, you'll be begging for more. It's like crack, you know? Like, 
how can you walk past it and not have some? 


When animals slaughtered for human consumption are referred to as designer products, 
cars and as a drug, they are objectified in ways that categorise them as objects, with not 
feelings or life. They are conceptualised as raw materials available for utilization. Nature 
and natural resources, except humans, are viewed only as a means to an end. Brands, cars 
anddrugs are considered to be luxuries by humans. Similarly, meat is also posited in ways 
that makes it sound like a luxury that people should aspire for. 


A person who buys branded clothes or accessories is perceived as having better taste, and 
also as someone who is doing well or better than others in life. Possessing a car has become 
as important as possessing a house. People dream about buying a new car but somehow, the 
desire doesn't always end with having just one car. We want more cars or better cars. We 
abuse drugs just as we abuse animals for meat. Metaphors that link meat with objects such 
as designer products, cars and drugs influence and encourage people to consume meat. You 
may not be able to afford expensive clothes or cars, but you can certainly eat meat instead 
and achieve similar status. 


2.5. MEAT IS A TREASURE 


The target domain of 'meat' is structured by the source frame of 'treasure' and its variants, 
which gives rise to the 'MEAT IS A TREASURE' metaphor. Consider the following 
examples where certain animals are referred to as 'treasure' and ‘jewels’, ‘precious’, : 


18. It's the flavour from that diet along with this breed's perfect marbling of meat and 
fatand three years of precise curing that make Jamon Iberico such a treasure. 

19. All over the Caribbean one of the most treasured treats that comes out of these 
waters is conch a giant snail like creature 


Meat of various animals is talked of as being a treasure due to various reasons, such as the 
processes that it has undergone over a period of time, its rarity and lesser availability, or 
the taste of its meat. In the following examples, shellfish and a dish made from different 
animal parts are referred to as ‘little jewels' and 'crown jewel’. 
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20. Selena a local diver who pulls in mounds of shellfish every single day ... Rodolfo 
finds these little jewels in about 40 feet of water just off the coast 

21. Then the crown jewel the kokoretsi oh, there's what I'm after, this really is a work 
of art... This kokoretsi, the lungs, the fat, the livers, the kidneys are so clean they're 
absolutely cooked perfectly. 


Similarly, in the following examples, pig meat is referred to as the most precious ingredient 
of a burger and a steak is referred to as the most precious item on a menu, respectively: 


22. Pulled pork is the crown jewel of this royal burger. 
23. I'm ready to go back to the basics it's time to relish the crown jewel of Buckhorn's 
menu the big steak yeah 


The fat that comes out of bacon is called 'gold' and wagyu, in the following examples, 
which is the meat of a Japanese breed of cow is referred to as a 'gem' along with other 
meats found in that restaurant. 


24. And something else magical happens in that oven as well as this renders and 
cooks, that gives me my bacon gold right there 
25. Dig further into this meat gallery, and you'll find other gems, too, like the Wagyu. 


Lastly, the skill of cutting diamonds is compared to cutting of ribs. Even the intensive 
training to become a diamond cutter is indirectly compared to years of training to become a 
master meat carver. Later in the excerpt, the carved meat is referred to as 'this diamond' and 
the reason for cutting the meat in this way is to simply 'show off. 


26. These hams are the culinary equivalent to cutting diamonds. Maestro Doris like 
Esther Vasquez received years of training to become master carver...... Wait, not only 
is the marbling so intense and so awesome that it's going to like get all juicy and 
you got this diamond 


In all the above examples, meat is considered to be a treasure of some sort. Animals 
are considered to be treasured entities only if they serve some purpose to a human's 
life, and not because of the value they add to the earth by simply existing. 


2.6. MEAT IS A CONSTRUCTION 


The meaning of the term 'construction' is the action of building something, typically a large 
structure. It can also be used to describe a building or other structure. In the example below, 
the first sense of the term is used: 


27. We're building a house of meat. She got a foundation of sausage, little chopped 
brisket, little bacon burnt ends and then somehow someway you try to fit into your 
mouth this sandwich, like your job is totally unhinged. 


Here, the source frame is the material for building a house, i.e. bricks, wood, cement etc 
and the target domain is meat. The host refers to different meats such as a sausage, brisket 
and bacon as the foundation of the dish just as different building materials are used to build 
a house. Similarly, the source frames of a tower, pyramid and mountain are metaphorically 
mapped onto the domain of meat. Consider the following examples: 


28. We're dad's pork pyramids, for the best..... The Greeks built the Parthenon, the 
Acropolis and the greatest architectural marvel known to carnivores, the gyro 
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company see the huge things of meat. 

29. It's good taste and good looks, the nautical vertical variety of the seafood tower 
two tiers, pure seafood delight. It is a tower of deliciousness 

30. This moisture tower is my Everest. With 150 mollusks down and 30 more to go, I 
have reached a point of no return. 

31. For the Beaufort, start with a toasted white bread bun add a smoky mountain of 
pulled pork and topped with coleslaw finish it 

32. Four mountains of mouth-watering meats 


In all the above examples, meat is the material that makes up the particular structure, 

i.e. a pyramid made of pork, towers made of seafood and mountains made of pork or other 
meats. These structures are utilized to imply that the dishes being talked about are made up 
of huge amounts of meat. Such metaphors promote the idea of over-consumption by 
collocating the term meat and seafood with words like mountains and tower, implying that 
the more the meat for consumption, the better it is. 


2.7. MEAT IS A CELEBRITY 


The various manifestations of the metaphor 'MEAT IS A CELEBRITY ' are visible in the 
discourse of all five shows. The qualities that generally hold true for a celebrity are mapped 
onto different kinds of meat products. The source frame of a Celebrity is evoked, by the use 
of trigger words like ‘star’, ‘sensation’, ‘hero’ or ‘legend’ to talk about the target domain, 
i.e. meat served in a food dish in restaurants. This frame negatively influences the viewers 
to think that 'meat' is supposed to be at the centre of food preparation and that only a ‘meat! 
product makes the dish special or noteworthy. There are rarely any instances observed in 
the food shows, where anything other than a ‘meat’ product is referred to as the 'star of the 
dish’, 'a sensation', 'a hero' or 'a legend’. Through this positive appraisal patterns, 
consumption of 'meat' is represented positively, implying that the particular food item is a 
star, a sensation, a hero and a legend because of the enormous amount of meat in it. This 
done by collocating the term ‘meat’ with terms like ‘newest’ and ‘sensation’ inthe phrases 
‘newest meat sensation' and 'meaty sensation’, by referring to meat filled pizza as a 'meaty 
legend’, creating collocations such 'seafood legend' - used for a restaurant that is verywell 
known for the seafood that it serves. 


In doing so ‘meat’ is given salience while the other ingredients are back grounded, as can 
beseen in the following examples from the shows Carnivorous and Food Paradise. 


33. The pork chop is our star, ... 

34. We're gonna add the star of the show, the beef ... 

35. 1 am in Vermont to explore the newest meat sensation. 

36. They're supporting all this juicy meat nice chunks too so when you bite into 
thattortilla you get a nice big meaty sensation. 

37. I've heard about a pizza that you guys make called the meaty legend. 

38. I am here at D knees a seafood legend in these parts for over 50 years 


In the following examples, the term ‘hero' is used for a sausage in the first example, because 
of the components of the sausage which are pork blood, bacon fat and pepper. In the second 
example, the tiger shrimp is referred to as the hero. The fact that different aquatic animals 
are procured from the beach for food gives rise to the name 'Muscle beach’. The tiger 
shrimp is considered as the hero of the Muscle beach due to the proportion of muscles on 
its body. 
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39. England has the Cumberland sausage - but the real hero of the sausage family is 
thebotifarra sausage. 

40. That's followed by octopus calamari and the hero of Muscle Beach. We leave 
theshell on the muscle because it's more fun to eat, this is tiger shrimp. 


2.8. EATING MEAT GIVES HAPPINESS 


Happiness is equated with feeling pleasure and contentment. It is a feeling as well as astate. 
The metaphor 'EATING MEAT GIVES HAPPINESS' emerges consistently throughout the 
food shows. Here, the target domain is meat and the source frame is the abstract feeling of 
happiness. The trigger words that give rise to the ‘happiness’ frame are ‘happy’ in 
‘keep people happy’, ‘dreams’ in ‘meaty dreams come true’, “porky- 
dreams’,‘fantasyland’,fantasies’ and ‘delights’. Consider the following examples: 


41. Over 1,500 pounds of meat are sold every day here. That's a lot of meat to 
keeppeople happy. 

42. It tastes like the happiest day when you eat bacon, it's my dream come true 

43. Usually the chef's call the shots but today is all about me and making my 
meatydreams come true. 

44, This swine-centric spot is run by Chef Geert Elzinga, who brought big, porky 
dreams with him from Germany. 

45. This bustling city loves to do things bright, funky, and over-the-top. Here at 
Butalian, it's like a Fantasyland for pork fans with 50 ways to satisfy you. 

46. Greco's street food will satisfy all your souvlaki fantasies and chicken delights 

47. Happiness is also said to be encapsulated in the feeling of love — love for meat. 


48. First time I tested meat, it was love at first bite and if you sink your teeth into 
thebrother, you'll fall in love 
49. I feel I feel great I feel full of love and corned beef 


2.9. MEAT IS A DESTINATION 


The term ‘destination’ carries the meaning of a place set for the end of a journey or to which 
something is sent. The following example is a clear-cut instance where 'meat' is referred to 
as a destination where people can take a train to. The suffix -ville in 'bacon-ville' is 
generally used to create a fictitious place's name based on specific qualities of the entity in 
question. A meat dish made up of chicken and cakes is referred to as 'a stop' similar to how 
different train stops are referred to as 'the next stop', along a journey from one place to 
another. Here, the source frame is a train station. 


50. That's the ticket on your one-way train to bacon-ville. The next stop is chicken and 
cakes a swiny spin on the classic dish chicken and waffles 


A variant of the 'MEAT IS A DESTINATION' metaphor is the 'MEAT IS A HOLY 
LAND 'metaphor which can be seen in the following examples: 


51. Country barbecue is holy land. My never-ending quest for barbecue bliss has led 
me right here 

52. But there was still one more meat pilgrimage to make 

53. We said hello to the most succulent steaks and choices jobs in this ultimate meet- 
and-greet so to satisfy your coniferous cravings make the pilgrimage to these meat 
Meccas', your stomach will thank you 
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Here, the source frame is that of a holy land or alternatively, a Mecca. In example fifty-one, 
country barbecue is referred to as a ‘holy land' which means that the place being referred to 
is an ideal spot for eating barbecued meat. In examples fifty-two and fifty-three, the term 
‘pilgrimage’ is used, which in its traditional sense means a journey made to a sacred place. 
The restaurant being talked about is the 'meat' destination in example two and the different 
restaurants that feature in a particular episode are referred to as 'meat Meccas' in example 
three. Mecca is considered to be the holiest place in Islam, while in the examples below, the 
term 'paradise' has a similar notion attached to it. Here, the source frame is a place called 
paradise. 
54. Go ahead -- get your bib on and grab your steak knives for a big slice of 
steakparadise. 
55. I discovered pork paradise in the Big Easy 


The meaning of the term 'paradise' here is a very pleasant place. The term ‘heaven’ similarly 
has the meaning of a place where blessed people of the Christian community go to in their 
afterlife. Here, the source frame is a place called heaven. 


56. I feel as though I might have died and gone to meat heaven folks 
57. I'm in whole hog heaven the flavours are just explosive. 


2.10. MEAT IS A RELIGION 


Meat is considered to be a traditional food item in many countries around the world. The 
eating and preparation of meat has a cultural significance in these societies and thus, is 
considered as a regular activity. Religion is yet another part of people's lives that involves 
following certain rituals on a regular basis. It also carries the sense of staunch belief in 
something. In the following examples, the source frame of religion is mapped onto the 
target domain of meat, giving rise to the metaphor 'MEAT IS A RELIGION'. 


58. Texas barbecue is fantastic, it's like a religion. 

59. The perfect cut the brisket is the quintessential cut of meat here in Texas, it's 
areligion 

60. If steak was a religion this would be its Cathedral now the old homestead is do 
youoriginator of the big fat steak 


2.11. MEAT IS A PERSON 


The 'MEAT IS A PERSON' metaphor arises from the fact that hosts of the food showsuse 
terms that are generally used to denote people. The two terms used to refer to males such as 
‘boys' and 'guys' are used to refer to dead bodies of animals or pieces of meat. Shrimps and 
oysters are referred to as 'boys'. Here, the source frame is that of a boy. In example sixty- 
one, as the chef attempts to coat the shrimps in the batter, the host tells the shrimps to take a 
good dip, as if they have the volition to do so. In example sixty-two, the phrase 'no oyster 
left behind' is a twist on the original 'no man left behind’. The host refers to oysters as 'them 
boys' which is a slang phrase used for a crew of people or a squad. In example sixty-three, 
oysters are referred to as 'fine boys', due to their high quality. In other examples, a piece of 
meat anda part of an animal is referred to as a 'bad boy' that has the meaning of 'a powerful 
or impressive product or item’. All these examples give rise to the metaphor 'MEAT IS A 
BOY'. 


61. I'm just hanging around teenies giant shrimp babe in a buttery bath infused 
withspices like paprika cayenne basil and garlic. Take a good dip boys soak it 
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62. What's next with these fine boy stirs just get a little bit of green onion garnish 
nicejob raw oyster time 

63. Let's see if we can get that little bad boy out of there and by bad boy I mean 
eyeballready for this 


Another variant of the 'MEAT IS A PERSON' metaphor is 'MEAT IS A GUY'. Here, the 
source frame is that of a guy. 


64. The flavour intensifies characteristics that you don't get and with just wet agent 
sowe're gonna season this guy real nicely 

65. I've got three lobster bodies I'm just gonna drop those guys right in there 

66. Those guys from there are gonna go straight over onto the grill. 

67. I got to tell you these little guys are good, the local frog stuffed with pork is 
apreparation that I haven't seen before 


In the above examples, meat is referred to as 'guy/s', which is a slang used to refer to 
aman or a group of people informally. In example sixty-six, the verb 'go' is used instead of 
‘put’, which implies a movement. In other examples, dead animals such as lobsters, 
monkfish, fertilized chickens and frogs are referred to as 'guys', beautiful guy’, ‘little guy/s. 


2.12. MEAT IS A MONSTER 


The term 'monster' that is used in the following examples carries the meaning of something 
which is extremely large and in some way scary. In example sixty-eight, it refersto the 
size of a dish that is made up of different meats, making it scary by its sheer size, whilein 
the other examples it refers to the size of a lobster. 


68. This mouth-watering monster is one of iguanas carefully prepared meats 

69. For this occasion a card is serving up the biggest lobster he has a nine-pounder. 
Wow oh my my, that's our monster! 

70. This guy is a monster. That's a two-pound rat. Sometimes the farmers will catch 30 
rats in one field 

71. Turns out there are monsters in there looking out over this peaceful water there's 
nothing to remind you of the Loch Ness monster or the Creature from the Black 
Lagoon but there are monsters in Lake Balaton... 


Here, the source frame is that of a monster. The term 'monster' has an inherently negative 
connotation since in its original sense it carries the meaning of a creature that is large, ugly 
and scary. In examples sixty-eight and sixty-nine, the term ‘monster’ has a positive 
connotation, where the size of the meat on an animal is equated to the flavourful taste that it 
has. In this sense, it can be considered as an ambivalent metaphor, since people find huge 
portions of food fascinating and prefer getting more for their buck as compared to 
justenough. If the portion of food is more, it is perceived positively and strangely, as being 
better. This gives rise to the appraisal pattern ‘BIGGER IS BETTER’ (Stibbe, 2015). As 
noted by Lakoff and Johnson (1980), this pattern is coherent with the UP/DOWN 
spatialization metaphor 'MORE IS UP’. These cultural values are deeply embedded in the 
minds of people and consequently, if entities in the world are linked to such patterns, they 
will automatically be perceived positively. Schumacher, in his book Small is Beautiful 
opposes the prevailing cultural evaluation of LARGE IS GOOD by praising the virtue of 
small in different situations. The SMALL IS GOOD evaluation is not ecologically 
damaging as its opposite LARGE IS GOOD evaluation that supports the idea or practice of 
consuming extra resources without any corresponding benefits. Similar attempts have been 
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made by the activist Vandana Shiva who collocates the term 'small' with intrinsically 
positive terms. However, in examples seventy and seventy-one, rats and a particular 
species of fish are perceived as 'monsters' inthe traditional sense. The rat is considered as 
a monster due to the fact that it destroys certain crops on the island, where people reside. 
Whereas, the fish referred to as a monster inexample seventy-one, is compared to the 
fictional Scottish Loch ness monster. In this case, it becomes a destructive metaphor. 


For particular groups of people, being able to consume a large quantity of food is 
considered as an achievement. Therefore, the ability to consume a monstrous dish is equal 
to taking down a monster. There is a sense of a war or competition that is clearly implied in 
the above examples. The host and the displayed meat dishes are considered opponents and 
thus, the notion of me versus it (the food dish) emerges. To win the battle, the opponent 
must be defeated. Nature is exploited just as animals when it comes to man's sense of 
supremacy. The viewers are dismissively prompted to eat more instead of telling them to 
eat less or perhaps, just enough. The moment people start demanding the same, more and 
more animals will be genetically modified to match their preferences, and the animals in 
the wild that are naturally bigger in size, will also be killed for the same reason. Eating 
more meat results in more animals being killed and eating big portions of meat may lead 
to wastage. The consumptionof different types of meat and meat products has substantial 
effects on people’s health, and livestock production can have major negative effects on the 
environment. 


3. Method 


The data for this study is collected via Youtube videos. The total number of episodes 
per show include: Man vs Food (27 eps), Bizarre Foods with Andrew Zimmern (19 eps) , 
Carnivorous (22 eps), Food Paradise (24 eps) and Food Paradise International (6 
eps). Thus, the total number of shows analysed are five, and total number of episodes are 98 
covering 55 hours approximately. The data is analysed on the basis of the Eco-Critical 
Discourse Analysis (ECDA) framework, applying the linguistic methods stated by Stibbe 
(2015), and thereby revealing discourses which are either destructive, beneficial or 
ambivalent from the perspective of the ecological wellbeing. The ECDA framework aims at 
analysing the stories-we-live-by and builds its framework by drawing upon various 
linguistic theories such as Critical Discourse Analysis (Fairclough 2003); frame theory 
(Lakoff and Wehling 2012); metaphor theory (Miller 2008); appraisal theory (Martin and 
White 2005); identity theory (Benwell and Stokoe 2006); fact construction (Potter 1996) 
and theories of erasure and salience (drawing on van Leeuwen 2008). The present study 
focuses on the story of Metaphor (Stibbe, 2015).This paper aims at mapping and thereby 
revealing multiple destructive metaphors about meat consumption which are positively 
advertised by the popular food shows mentioned above. 


4. Discussion 


It is worth noting that the metaphors discussed in this paper can be considered as 
different forms of entrenchment. Ordinary language is filled with metaphors of varying 
degrees of entrenchment. Metaphors can indirectly convey ideas or practices that could 
endup having a lasting impact on memory. The commonalities that are observed across the 
analysis are how eating meat is promoted, animals are objectified as a means to an end, 
excessive meat consumption is perceived positively and meat is viewed as an important 
part of life. Animals, whether farm raised/produced or living in their natural habitats are 
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portrayedas if they are either there to be eaten or waiting to be eaten. Not eating meat is 
frowned upon and viewers are made to believe that they are missing out on life altering 
experiences. 


Giving up meat completely may not always be the answer. Based on the reasons for eating 
meat, we can try to think about alternative ways that would prove to be ecologically 
favourable. If one consumes meat for its nutrients or protein content, perhaps one could try 
different types of beans or vegetables that are equally rich in nutrients. However, if one 
consumes meat because of its taste, one could possibly start by eating less meat. There are 
emerging movements that support reducing one's meat intake. The Reductarian Movement, 
founded by Brian Kateman is built on the core belief that cutting back is more effective 
than cutting meat completely. According to them, even a 10% reduction in meat 
consumption can have significant effects on personal health, the lives of 70 billion farm 
animals and global warming. One could also maybe try eating meat substitutes, i.e. meat 
analogues that have similar texture and characteristics as meat. The most crucial change 
that most meat eaters canmake is by buying meat responsibly. By choosing meat and dairy 
from ethical producers overintensively farmed produce, everyone can collectively make a 
big difference. 


5. Conclusions 


The present study aims to identify and analyze ways in which Metaphors are used by 
food shows specifically with reference to animals and the consumption of meat. This paper 
maps and analyses multiple destructive metaphors about meat consumption which are 
positively advertised by popular food shows such as; Man vs Food, Carnivorous, Bizarre 
Foods with Andrew Zimmern, Food Paradise and Food Paradise International. We 
identified and discussed twelve metaphors such as Meat is an art, Meat is a brand, Meat is a 
car, Meat is a drug, Meat is a treasure, Meat is a destination, Meat is a religion, Meat is a 
construction, Meat is a religion, Eating meat gives happiness, Meat is a celebrity and Meat 
is a Monster. Certain metaphors such as Meat is an art, Meat is a drug, Meat is a car, Meat 
is a destination and Eating meat gives happiness can be particularly considered as 
destructive metaphors since meat is represented as the ultimate, most perfect food item. 
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Quantification, The present paper is comparative and quantitative assessment of Hindi and 
Hindi, its dialect Magahi spoken in Bihar state of India. Magahi traces its 
Magahi, existence from pre-Ashokan period. The language has its origin in 
Phonological Generalization Magadhi. Bharata Muni has discussed seven major dialects (bhdsd) in his 


Natyasastra and he keeps Magadhr in the first place. The language 
experienced many ups and downs and finally in 1961 census it was legally 
absorbed under Hindi, and hence the dialect of Hindi. These languages 
share many similarities and regular dissimilarities as well. In this paper we 
discuss regular dissimilarities at phonological level and see the degree to 
which they shared similarities. To prepare our lexical database for the 
present study, we have considered lexical items from Swadesh list prepared 
by linguist Morris Swadesh. We have then listed their equivalents in Hindi 
and Magahi. The main goal is to show the similarities and dissimilarities 
based on the basic vocabularies of the language and its dialect by 
considering the different domains like body parts, color terms, basic verbs, 
kinship terms, number words etc. We have considered Magahi as base 
language and compared Hindi. Based on the exhaustive list, we have 
compared the lexical items and classified them into different categories. 
They are exact matches, minor changes group etc. We then attempted to 
find out the phonological evidences across the languages viz. gender 
marker deletion, neutralization, diphthongization, apocopation etc. Besides 
we used pie chart to present the quantification of our lexical database 
according to the different categories mention above viz. nearly 41% lexical 
items of the database belonged to exact matches group. 


1. Introduction 
“sabbe:sa.” mu:lab'a:sa.j ma:gad'a.j niruttija:” 
This line stating, Magahi as fundamental or basic language, has been written in Mahavansha!. 
Magadhi Apbhransh is the mother language of Magadht, from which present day Magahi 


came into existence. Characters of low social and economic status in many classical dramas 


1 Mahavansha (Chulvansha 37, 230, 242, 4) 


(20208. 
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such as Mrcchakatika by Siidraka, Mudrarakshasa by Vishakhadatta, Venisamhara by Bhatta 
Narayana, Abhigyan Shakuntalam by Kalidasa etc. have been noticed speaking Magahi. 
Speech of a person in Sudraka’s Mrcechakatikam has been presented below as an example. 


“cahaje: kile: je: vinitdie: n hu de: kamm vivayani:-ae: | 
pacuma:lonakammada:lune: anuka"pa:midu je:vv corltie: |” — 6.1 


Magahi did not received patronage of powerful hands later and also due to invasion led by 
Khlji in Vihar (later Bihar Sharif) libraries in 1191 C.E. and Nalanda University in 1200 C.E., 
the language suffered many ups and downs. In 1950, Hindi was made the only official 
language of Bihar (1950, Bihar Official Language Act). Finally, in 1961 census Magahi was 
legally subsumed under Hindi (1961 Census). 

Hindi is one of the two official languages of India. It is included in the 22 scheduled 
languages of the republic of India (1950, Constitution of India, Article 343). It is the third 
most spoken language of the world (Cf. Ethnologue) and the most spoken language in India 
(Census 2011). Hindi is mainly spoken in Bihar, Uttar Pradesh, Madhya Pradesh, 
Chhattisgarh, Haryana, Himachal Pradesh, Jharkhand, Rajasthan and Uttarakhand (Cf. 
Wikipedia2020). Hindi accommodates 56 mother tongues (apart from ‘Other’ section) under 
it (2011 Census). The language is written in Devanagari script. Magahi is a dialect of Hindi 
and is spoken in Bihar state: Bhagalpur, Gaya, Patna, Jehanabad, Aurangabad, Arwal, 
Nalanda, Nawada; Jharkhand state: Hazaribagh, Palamu, Chatra, Koderma, Jamtara, Bokaro, 
Dhanbad, Giridih; West Bengal state: Maldah district (Cf. Ethnologue). Magahi was written 
in Kaithi script in olden days but at present it is written in Devanagari script. Sir Grierson 
(1885) put Magahi in Bihari language (L.S.I. Vol. I, Part I, pp 120) along with Maithili and 
Bhojpuri and also conducted many studies in Magahi. 

The paper aims to prove that Hindi and its dialect Magahi are genetically related and present 
a quantitative assessment of the lexical database considered for the present study. 

The paper has been divided into six sections. It discuss Introduction, Related works, 
Methodology, Data analysis, Discussion and Conclusion. 


1.1 Literature Review 


In this section we discuss some related works by eminent scholars in the field of 
comparative grammar. These works have provided better insights to look deeper into the 
similarities and the regular dissimilarities among the cognates. The related works of Rao 
(2017), Lyngdoh (2017), Kasak (2016), Rashel (2012), Mishra and Bali (2012) are 
considered here for the review. 


Rao (2017) has worked on Evidence for the Genetic Relationship of Mongolic and 
Dravidian Languages. This work is a comparative study of the proto-form of Mongolic 
and Dravidian language families. Author provided a significant number of cognates and 
a certain number of peculiar correspondences. He has proved his claim that the two 
language families (Dravidian and Mongolian) share common origin by using his 


2 Mrcchakatikam(editor- Jayashankar Lal Tripathi ) 


Corresponding Author: Supriya Chauhan 
Email: supriyachauhanaisha@gmail.com 


24 Chauhan and Tiwari / 2021 


evidences. He has provided evidences at two levels at phonological level and 
morphological level to prove that these language families are genetically related. 


Lyngdoh (2017) has worked on Reconstructed Proto-Meitei-Chin Root Words and their 
Cognates: A Brief Survey of Consonantal and Vowel Shifts in Meitei-Chin languages. He 
has taken 230-odd PMC? roots as his database. He tried to reconstruct the Proto-Meitei- 
Chin proto-form. In this paper author has taken some reconstructed PMC root words and 
traces its survival into the two MC* sub-groups — namely the Meitei language and the 
Kuki-Chin sub-group. An attempt is also made to identify certain consonantal and vowel 
shifts which have been developed in Meitei and in Mizo. He reconstructed the Proto- 
Meitei-Chin Root Words. 


Kasak (2016) in A Distant Genetic Relationship between Siouan-Catawban and Yuchi 
has tried to show genetic relationship between Siouan-Catawban and Yuchi languages. 
Author used his earlier references like Sapir’s “Hokan-Siouan”, “A possible link to 
Proto-Gulf’ and some other works as well to work on this paper. Author discussed 
phono-lexical evidences (using sound inventory, cognates etc.) and morphological 
evidences (classifiers, pronominal morphology, template morphology etc.) to support his 
hypothesis. He has successfully proved the distant relationship between Siouan- 
Catawban and Yuchi by using his evidences. 


Rashel (2012) has worked on Standard Colloquial Bengali and Chatkhil Dialect: A 
Comparative Phonological Study. Author has provided various evidences at phonological 
level like loss of aspiration in Chatkhil dialect, consonant behaving like vowel etc. to 
support his claim. 


Mishra and Bali (2011) has worked on A Phonological Study of the Dialects of Hindi. 
Authors have presented phonological description of seven of the major dialects of Hindi, 
namely, Awadhi, Bagheli, Bhojpuri, Bundeli, Haryanvi, Kanauji, and Khari Boli. They 
have presented a comparative study of consonantal and vowel inventories of the seven 
dialects of Hindi based on the analysis of telephonic conversational speech. 


1.2 Research questions 
In the present paper we try to seek answer for the following questions: 


1. Are Hindi and Magahi genetically related speech varieties? 

2. Magahi is much older speech variety than Hindi since we find its use in classical 
dramas of Kalidasa, Bhatta Narayana many others (see section 1), how did 
Magahi contribute to Hindi? 


2. Article structure 


ww 


PMC - Proto-Meitei-Chin 
4 MC - Meitei-Chin 
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The paper has been divided into six sections. Section | discusses Introduction; it has two 
sub-sections namely Literature review and Research questions respectively. Section 2 
discusses Article structure. Section 3 discusses Method. It is further divided into four 
sub-sections namely Data collection procedure, Data analysis, Phonological 
generalizations and Quantitative analysis respectively. Sub-section 3.2 i.e. Data analysis 
is further divided into four sub-sections. Sub-section 3.3 i.e. Phonological 
generalizations is further divided into seven sub-sections. Section 4 discusses Results. 
Section 5 is Discussion and section 6 discusses Conclusion. 


3. Method 


Comparative method is the most celebrated method to establish correspondences among 
the languages compared. It is a systematic process of reconstructing the inventory of an 
ancestral language from cognate reflexes in the genetically related daughter languages 
(Wesis, 2014). We have implied the method in this paper. The lexical inputs for the 
lexical database have been taken from the Swadesh List. The list has been designed by 
linguist Morris Swadesh. It is a paramount compilation of the basic concepts for the 
purpose of historical-comparative studies. These basic vocabularies are quite common in 
any known language. It is presumed that the languages have their own equivalent lexical 
items for them and have not borrowed from other language. The list contains lexical 
items in English; we have listed its equivalents in Hindi and Magahi. The list includes 
various domains such as body parts, like ‘eye’, ‘claw’, ‘leg’, color terms, like ‘black’, 
‘red’, ‘white’, basic verbs, like ‘to come’, ‘to eat’, ‘to hear’, nominal, like ‘animal’, 
‘dog’, ‘sun’, pronominal, like ‘TP’, ‘thou’, ‘we’, adjectives, like ‘small’, ‘thick’, ‘old’, 
number terms like ‘one’, ‘two’, ‘three’, etc. 


3.1 Data collection procedures 


The lexical database consists of two hundred and nine lexical items. To create the 
database we have merged two hundred word Swadesh list and hundred word Swadesh 
list so that all the major basic vocabularies could be covered. Data has been collected 
from the native speakers of Hindi and Magahi. Magahi data has been collected from 
Jehanabad district of Bihar. We have cross-checked the data with other native speakers. 
We have considered Magahi as our base and have compared Hindi on phonological 
level. It is because we consider Magahi is much older than Hindi and its origin can be 
traced from later Vedic period. 


3.2 Data analysis 


We have done comparative analysis of our lexical database. We noticed some 
phonological similarities as well as some regular dis-similarities while analyzing the 
data. Based on the degree of similarities they share, we have categorized whole database 
into four different groups. They are: 


1. Exact Matches 
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2. Minor Changes 

3. Complex Changes 

4. Lexical Changes 

These categories are illustrated below with greater details and appropriate examples. 
3.2.1 Exact Matches 


We came across a very large number of lexical items which are similar in Magahi and 
Hindi. Some of the examples are listed below: 


Table 1.1: Exact Matches 





3.2.2 Minor Changes 

This group included the lexical items whose equivalents varied slightly. They differed in 
terms of single sound or syllable. Our lexical database has sizable number of such 
equivalents. Some of the examples are listed below: 


asubstance 


righthand 
ind rez) 


Table 1.2: Minor Changes 





3.2.3 Complex Changes 
This group included the lexical items whose equivalents were partially similar. Changes were 
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noticed in more than single sound or syllable. The database has ample amount of such lexical 
items. Some of the examples are listed below: 


pat'al patt'or ‘stone’ 
porch: 
baris vere ‘year’ 


Table 1.3: Complex Changes 





3.2.4 Lexical Changes 

Very few lexical items were seen in the selected languages whose equivalents do not share 
similarity at all. This group included those dis-similar equivalents. Some of the examples are 
listed below: 


a re ee 
P| nim | sect | goo 


ma.i" 
ochiga ‘there’ 
kak*ani: 





Table 1.4: Lexical Changes 
3.3 Phonological generalizations 


Based on our analysis and observation of the data we have tried to make some generalizations 
at phonological level. These are displayed in the tables below followed by illustrative 
examples. 


3.3.1 Neutralization 

We found the process of neutralization while analyzing our database. The distinction between 
‘’ of Magahi and ‘7’ of Hindi has been neutralized in the word final position. Some of the 
examples are illustrated below: 
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Table 1.5: Neutralization 





3.3.2 Apocopation 

Apocopation is the phenomenon of deletion of final sound. While analyzing our data we 
noticed that Hindi has undergone the process of apocopation. Some of the examples are 
illustrated below: 


ee ee 
et eee! 
one oe ee 


Table 1.6: Apocopation 





3.3.3 Gender Marker Distinction 

Based on the analysis of our database we noted that gender marker distinction is not present 
in Magahi but Hindi has acquired it, may be during the process of standardization. Some of 
the examples are illustrated below: 


mo.ta: (Mase.)/ ‘fat(substa 
mo.{i: (Fem.) nce)’ 


puran pura:na: (Masc.)/ 
puracni: (Fem.) 


sor ase 
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Yasar Rem 
clo.f, nat clo.fa:, na:f{a: 


(Masc.) | cho:fi:, 


nati: (Fem) 


clo:ta: (Mase.) / 
clo.fi: (Fem.) 





Table 1.7: Gender Marker Distinction 


3.3.4 Vowel Lengthening 

We noticed many lexical items where vowel in the word final position became elongated. 
This phenomenon of short vowel being elongated is known as vowel lengthening. The 
following examples from our lexical database represent the examples of vowel lengthening. 


ion 


Table 1.8: Vowel Lengthening 





In table 1.8, a short vowel ‘a-’ occurring in the word final position in Magahi has became *- 
a:-’in Hindi. 


3.3.5 Vowel Lowering 

Vowel lowering is the process where a high vowel becomes a mid or low vowel, or a mid 
vowel becomes a low vowel. We noticed this particular phenomenon while analyzing our 
data. Some of the examples are illustrated below: 


righthand) 
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asama.n a:sama:n ‘sky’ 


Table 1.9: Vowel Lowering 





In table 1.9, a mid vowel ‘“a-’of Magahi occurring in first syllable in example 1, 2, 3, and 4 
has become a low vowel ‘-a.-’ in Hindi. In example 5 and 6, a high vowel “u-’ of Magahi in 
first syllable has become a mid vowel ‘-o.- ’in Hindi. 


3.3.6 Diphthongization 

Diphthongization is the process of change of a single vowel into a sequence of vowels. We 
noted several examples of this phenomenon in our lexical database. Some of them are listed 
below: 


be wh 


Table 1.10: Diphthongization 





In table 1.10, monopthong ‘-o.-’ of Magahi has become diphthong ‘“a-w-’ in Hindi in 
example 1. In example 2, monopthong ~i-’ of Magahi has become diphthong *a.7-’ in Hindi. 


3.3.7 Loss of Distinction between Alveolar s and Palatal s (s and ¢) 

Magahi has lost distinction between ‘s’ (alveolar s) and ‘¢’ (palatal s). Magahi uses ‘s’ to in 
place of ‘ce’. The following examples of this particular phenomenon were noticed in our 
lexical database. 
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Table 1.11: Loss of Distinction between Alveolar s and Palatal s (s and ¢) 
3.4 Quantitative analysis 


Apart from working on finding phonological generalizations in the paper, we also tried to 
make quantitative assessment of our lexical database. Our database has been categorized into 
four groups based on the degree of similarity they shared. We tried to find out that how much 
similarity does Magahi share with Hindi and used graphical representation of pie chart to 
show the quantification. Our quantification is based strictly on our database which consists of 
two hundred and nine words. The graphical representation has been illustrated below. 





Magahi-Hindi Similarty Chart 


ld | Exact Matches 
Minor Changes 

2, Complex Changes 
Lexical Changes 











4. Results 
We identified four groups out of our lexical database and tried to quantify the database based 
on the similarities they shared. 
e The first group was Exact Matches group and approximately 41% lexical items of the 
database belonged to this group. 
e The second group was Minor Changes group to which approximately 27% lexical 
items of the database belonged. 
e The third group was Complex Changes group and nearly 18% lexical items of our 
database belonged to this group. 
e The last i.e. the fourth group was identified as Lexical Changes group and nearly 14% 
of the lexical items of the database belonged to this group. 
Besides we made certain peculiar phonological generalizations out of the database. Some of 
them are mentioned below. 
e We identified the process of neutralization where the distinction between ‘7’ and ‘7’ 
has been neutralized, like jar ‘to burn’ of Magahi became ja/ ‘to burn’ in Hindi. 
e Apocopation was found in Hindi e.g. suf ‘sleep’ became so: ‘sleep’ in Hindi. 
e Gender Marker Distinction has not been found in Magahi but Hindi seems to 
acquire it later during the standardization process. For example, do:sar ‘other’ 
of Magahi has become qusara: (Masc.)/ dusari: (Fem.) ‘other’ in Hindi. 
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e The only phenomenon i.e. loss of distinction between alveolar ‘s’ and palatal ‘s’ has 
been found in Magahi while Hindi retained the distinction. Magahi speakers seem to 
lose this distinction for the ease of pronunciation. 

e Some other processes such as vowel lengthening, vowel lowering, diphthongization 
etc. have been found in Hindi. 


5. Discussion 


Proceeding from the analysis and observation of our database, we classified the whole data 
into four categories on the basis of shared similarities and dissimilarities. Hindi and its dialect 
Magahi share many prominent features viz. gender marker distinction, apocope, 
neutralization, diphthongization etc. These phenomena have been discussed above (see 
section 3.3) in details with appropriate examples. Based on the shared similarities and 
phonological correspondences found in data analysis, we claim to find very large number of 
cognates in the data. Further, on the basis of cognates and phonological generalizations (see 
section 3.3), we answer our first research question and conclude that Magahi and Hindi are 
genetically related. We quantified the database and represented the findings using pie chart 
representation (see section 3.4) and thus answered our second research question. 


6. Conclusion 


The present paper is an attempt to do a comparative and quantitative analysis of Hindi and its 
dialect Magahi. The similarities and correspondences found out cannot be simply explained 
until and unless we presume that Magahi and Hindi are genetically related. This kind of work 
may provide base for further studies in comparative - historical study of Indo-Aryan 
languages at higher levels such as morphological, syntactic etc. and proto-language 
reconstruction. 
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ESL, Pronunciation, Speaking, In English Language, Speaking Skill is considered as one of the most 
Tasks and Methods significant skills among the four skills like Listening, Speaking, Reading and 


Writing (LSRW Skills). The professional circumstances in today’s scenario 
necessitate and endorse that this skill should be imbibed by every learner. 
The important challenges that are faced by the learners while attempting to 
acquire speaking skills are with that of pronunciation because different 
sounds are stressed in varied ways in different languages. Appropriate 
pronunciation of words is very essential to improve one’s speaking skill. The 
ESL learners, in particular have to pay more attention in learning the 
pronunciation nuances because intelligible pronunciation is one of the 
indispensable components of oral communication and therefore becomes 
inevitable for the ESL learners. Varied tasks and techniques to teach 
pronunciation lessons will always uphold the ESL learners to acquire English 
pronunciation. This paper throws light on teaching such tasks and methods 
that can be adopted by the teachers in a learning environment to the ESL 
learners so that they learn the art of pronouncing a word properly and 
confidently. 


1. Introduction 


The professional circumstances of today’s academic scenario necessitate and endorse that 
Speaking Skill is very essential and that it should be imbibed by every learner. The important challenges 
that are faced by the learners while attempting to acquire speaking skills are with that of pronunciation 
because different sounds are stressed in varied ways in different languages. Appropriate pronunciation 
of words is very essential to improve one’s speaking skill. Chan (2017) stated that learning how to 
pronounce the target language is very important at the beginning stage itself. The ESL learners, in 
particular have to pay more attention in learning the pronunciation nuances because intelligible 
pronunciation is one of the indispensable components of oral communication and therefore becomes 
inevitable for the ESL learners. In such circumstances, teaching pronunciation to ESL learners becomes 
a big challenge for teachers. According to Broughton et. al (2003) pronunciation teaching involves 
understanding the method of speech and fluency of a language. Basically, the learner needs more 
practice in listening and speaking skills which ultimately helps the skill of learning pronunciation for a 
second language learner. In order to learn the pronunciation of a foreign language, ESL learners have to 
acquire many skills. First of all the learner must be familiar with the language, learn the nuances of 
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listening, speaking and must also be familiar with the grammar aspects of the language. Further, 
Broughton et. al (2003) opine that learning pronunciation involves acquiring recognition skills so that 
the ESL learner can listen to the new language and also learn to pronounce the words properly. Hence 
the role of the teacher becomes very vital in teaching pronunciation. 


1.1. Literature review 


Teaching pronunciation has been attempted using diverse techniques for so many decades and 
various methodologies have been formulated by researches. Celce-Murcia, et. al., (1996) suggested two 
methods for teaching pronunciation; intuitive-imitative approach and the second method analytic- 
linguistic approach. In the first method the learner listens to a teacher or an audio and imitates. The 
second method employs tools such as a phonetic alphabet, articulatory descriptions, charts etc. 
McCarthy (1991) observed that pronunciation teaching in involves teaching the sounds of language 
called phonemes. Jenkins, J. (2004) stated that teaching pronunciation include consonant and vowel 
sounds, word stress patterns, rhythm and intonation. Grammar proficiency is also necessary to learn 
pronunciation. Parker, M. (2000) suggested that video and audio activities help in improving the learners 
listening proficiency and pronunciation. Bachman (1990) pointed out that pronunciation is a part of 
grammatical competence. Kelly (2000) found that the preferred pronunciation model for teaching 
pronunciation is Received Pronunciation (RP). Dalton and Seidlhofer (1994) related communicative 
abilities of the learners to pronunciation. Gilakjani (2011) pointed out that the teacher must be aware of 
the approaches involved in teaching pronunciation. Baker and Murphy (2011) inferred that training 
programs for teachers are insufficient and that necessary pedagogical training, techniques, and tools are 
needed to teach pronunciation. Begum, A., & Hoque, M. A. (2016) identified that social factors, such 
as lack of motivation by the friends or teachers to produce correct English as one of the factors affecting 
the process in learning pronunciation. Chan (2018) pointed out that pronunciation can be taught by 
identifying syllables and stress; using audio and visual methods and using total physical response 
method. 


1.2. Research questions 


The Research questions are: (i) What are all the challenges faced by the teachers and the ESL 
learners in teaching and learning English Pronunciation respectively? (ii) What are the tasks and 
methods that can be adopted to teach pronunciation for the ESL learners? 


1.3 Research Methodology 


The scope of the study is confined only to the Undergraduate Engineering College students. The 
researcher followed convenience sampling method. Since the study is an experimental study, the 
purpose of the study was clearly explained to the students. This experiment was conducted among 100 
undergraduate Engineering College Students. 


2. Challenges in learning Pronunciation 


The usual problems faced by the ESL learners in learning pronunciation are stressing the wrong 
words, pronouncing the consonant sound incorrectly, confusing long and short vowels. Further, 
intonation is also one of the essential aspects of pronunciation. But ESL learners are totally unaware of 
the pattern of the foreign language intonation. In certain cases, even the highly proficient students fail 
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to follow the patterns properly. Gilakjani (2011) mentioned that some of the challenges faced by learners 
in ESL learners include minimal exposure to the target language, learners’ difficulties in English spelling 
systems, lack of enough attention to stress, rhythm, and intonation. In addition, learning a foreign 
language for an ESL learner is not easy and it is only considered as a physical task which requires 
conscious movement of one’s tongue, throat, and lips. Also, there is no graphical enunciation to 
demonstrate these differences from one sound to another. Hence English pronunciation is generally 
considered to be difficult and confusing for second language learners. 


3. Tasks and Methods for teaching Pronunciation 


3.1 Teaching pronunciation 


Teaching pronunciation to ESL learners with limited training and exposure, in all means is a 
challenging task for the teachers. Chan (2017) mentioned that teachers of beginners play a significant 
role in helping the students to learn pronunciation. However, they can always use the traditional methods 
of teaching like, teaching with phonics exercises, teaching intonation, syllable stress, and vowel length; 
teaching different mouth positions to master pronunciation, minimal pairs, consonant clusters etc. 
Teachers can also employ the common and usual method of teaching pronunciation, i.e., to make them 
listen to an audio and practice. 


3.2 Phonics 


The most common method for teaching English pronunciation to ESL learners is through 
phonics. Phonics focuses on learning the individual sound of a letter or set of letters, vowels, and 
consonants while learning to read. Teaching minimal pairs help the learners to differentiate 
pronunciation between pairs of words. They mainly help to understand the different phonemes in the 
language. 


3.3 Minimal Pair 


A minimal pair usually comprises of two words with similar sounds but two different meanings. 
This pronunciation activity mainly helps to understand the sounds in a meaningful context. Some of the 
examples of minimal pairs are: 


/desk/ - /disk/ 

/shot/ - /short/ 

/ship/ - /sheep/ 
/hot/ - /heart/ 
/pants/ - /paints/ 


3.4 Consonant Clusters 
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The next simple task is teaching Consonant Clusters. This also facilitates the beginners to learn 
pronunciation to a better extent. Consonant Clusters refers to two consonants that appear at the beginning 
of a word and that are pronounced together. For example, 


/snail/ 
/speak/ 
/smart/ 
/brush/ 
/bleach/ 
/plaster/ 

/great/ 

/glue/ 


Similarly, teachers can explain the difference between released and unreleased vowels, stress 
and rhythm, unpronounced vowels. They can also teach the students some of the minute details in 
learning pronunciation like, the vowel that appears immediately after the stressed vowel most of the 
times is not pronounced. 


/different/ 
/vegetable/ 
/comfortable/ 
/language/ 
/information/ 
/answet/ 
/sentence/ 


4. Discussion 


Proper pronunciation is essential to master any language. In spite of good vocabulary knowledge, 
erroneous pronunciation will only make our communication a poor one. And some of the English words 
are evidently difficult to pronounce. So, it becomes essential to teach English Pronunciation to the ESL 
learners from the beginning. Further, teaching pronunciation skills has to be given equal importance like 
that of the other skills namely Speaking, Listening, Reading and Writing. In the course of learning the 
basic nuances of pronunciation especially through Phonics, the ESL students will learn to speak and 
pronounce appropriately. In addition to the training provided by the teachers, the ESL learners need to 
have interest and motivation to learn pronunciation. With the above mentioned methods to improve 
pronunciation, they can also gain knowledge about mouth positions, practice tongue twisters, listening 
and word stress and intonation. 


5. Conclusions 


Learning a foreign language itself is a challenging task for ESL learners. Amidst that, acquiring 
pronunciation skills not only demands learning Listening and Speaking Skills but also involves a 
considerable amount of understanding the ways of sound systems of the language. However, the 
organization of the sound system involves not only vowels and consonants but also stress and intonation. 
Therefore, learning phonetics by all means becomes very essential for the ESL learners. And obviously, 
the entire burden is on the teachers ultimately who must create a conducive classroom ambience to teach 
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them for the ESL learners. Most importantly, teachers should first of all implant the confidence among 
these learners and can teach pronunciation using various creative and fun activities in the classroom. 
However, lack of necessary tools to teach proper pronunciation is a major missing in classrooms, in 
most cases. Essential infrastructure to improve the pronunciation skills are the need of the hour for many 
of the ESL learners. Hence, teachers can use the tasks and methods that can be adopted in a learning 
environment to the ESL learners so that they learn the art of pronunciation. 
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readability formulas; textbooks; The paper presents the results of a corpus study based on lexical and textual 
Kerala educational system linguistic features. The study is aimed at computing readability with the 


Flesch Reading Ease formula and Coh-Metrix L2 Readability formula. The 
corpus compiled of the Kerala State Council of Educational Research and 
Training (SCERT) 9" grade English part 1 and part 2 textbooks were 
analysed using two tools; Coh-Metrix and Text Inspector. The computed 
readability measure and vocabulary profile helped assign the Common 
European Framework Reference (CEFR) level to the text (Al- beginner to 
C2- proficient). Significant research has been carried out to map textbooks 
and international tests onto the CEFR level. The CEFR thus helps to provide 
a common basis for language textbooks, curriculum, and examinations. From 
the corpus analysis, the CEFR level of the Kerala SCERT 9" grade English 
textbooks was estimated to be between A2 and B2. Thus, this paper aims to 
provide a clear picture of the English textbook complexity and a depiction of 
its position in the educational system using the readability formulas and 
CEFR level. 


1. Introduction, Background and Scope of the Study 


In schools and other academic contexts, textbooks are a primary source of knowledge and content. They 
are important vehicles written for the purpose of teaching and learning to promote the vision of the 
curriculum. According to Stray (1993), textbooks are “designed to offer a pedagogical and didactic 
presentation of a certain field of knowledge” (Lebrun, Lenoir, Laforest, Larose, Roy, Spallanzani, 
Pearson, 2002). The National Council of Educational Research and Training (NCERT) 1970 document 
defines the textbook as a systematic, prepared, summarized material that considers learners' needs and 
interests (Yekkar, 1993). From prior research, it is evident that several features of textbooks significantly 
impact teaching and learning; content, design, structuring, and sequencing of text and exercises (Byram, 
2000, Lebrun et al., 2002). This has led to textbook research becoming an indispensable part of the 
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teaching-learning process. One approach in textbook research is to examine the readability of the 
textbook (Chavkin, 1997; Wissing, Blignaut and van den Berg, 2016; Wray and Janan, 2013) on account 
of the correlation of readability and comprehension. 


In reading science, lexical and textual linguistic features play a significant role in comprehension in 
addition to the reader, his reading ability, interest, and motivation. Mc Laughin focuses on this 
interaction between the text and its readers, their reading skill, prior knowledge, and motivation (Dubay, 
2004). Comprehension of text is thus bound by the degree of these features. The more difficult and 
complex these features are, the more difficult it is to process the text. Though the reader, his reading 
ability, interest, and motivation play a significant role in reading, the book is a primary component. 
Readability refers to the level of difficulty that affects a person’s comprehension or the ease with which 
a person can understand a written text. According to Dale and Chall (1949), readability is the “sum total 
(including all the interactions) of all those elements within a given piece of printed material that affect 
the success a group of readers has with it. The success is the extent to which they understand it, read it 
at an optimal speed, and find it interesting” (Dale and Chall, 1949; DuBay, 2004). 


Though the concept of readability has been of interest for centuries, researchers like Klare, 1963; 
Chall, 1988; Chall, 1947; Lively and Pressey, 1923; Fry, 2002; Dubay, 2004; Dalecki, Lasora and Lewis, 
2009 (Bailin and Grafstein, 2016) have acknowledged L.A. Sherman, 1893 and Thorndike, 1921 as the 
earliest researchers to use statistical analyses of text in measuring reading difficulty. The 1920s thus 
marked the beginning of incorporating readability as a factor in the field of education. Edgar Dale and 
Jeanne Chall point out that early researches carried out in readability were by educators who had to find 
some means of selecting books for different grades, “putting the right book into the hands of the right 
reader” (Dale and Chall, 1949). Though readability was considered a vital topic in the education 
literature between 1980 and 1995, the subsequent years (1995-2010) saw a decline in the interest in 
readability research. However, with the decline in students' ability to read complex text (Wray and Janan, 
2013), textbook complexity has re-emerged as a significant issue in education literature. 


The applicability of research in readability is not just limited to the grading of school textbooks 
(educational sector) (Perekeme and Aghor, 2012; Rohmatillah, 2015; Rahmawati, 2014; Luong, Nguyen 
and Dinh, 2017; Xia, Kochmar and Briscoe, 2019) but is applicable in medical text/ prescriptions 
(Wallace, Keenum, Roskos, Blake, Colwell, and Weiss, 2007; Pires, Vigario, and Cavaco, 2016; Pires, 
Cavco, and Vigario, 2017), law (Waltl and Matthes, 2014; Bourcier and Mazzega, 2007; Katz and 
Bommarito, 2013) and software readability (Buse and Weimer, 2010; Kanter, Muscarello, and Ralston, 
2008; Posnett, Hindle, and Devanbu, 2011). Yet, despite the wide application of readability, research on 
readability aspects from India is mainly limited to the field of medicine (information pamphlets and 
consent forms) and editorials (Kumar, Konidena, and Varadarajulu, 2013; Raj, Sharma, Singh, and Goel, 
2016). Presently, considerable research is also being carried out in Bengali language and Hindi since the 
work of Das and Roychoudhury in 2000 and Agnihotri and Khanna in 1991 (Phani, Lahiri, and Biswas, 
2018; Sinha, Sharma, Dasgupta, and Basu, 2012). However, Indian readability research is, to date, 
partial to literary text and general documents. As evidenced, readability research on English textbooks 
in India has not been given much significance. This article emphasizes the necessity of such readability 
research for English education in India as language teaching and learning revolves around textbooks. 
The article thus examines the 9™ grade English textbooks in Kerala (a state in India) to compute 
readability measures to provide a clear picture of the textbook complexity and subsequently depict the 
position of the textbook in the educational system. In doing so, the researcher hopes to fill the research 
gap that will provide administrators and educators additional measures in selecting and grading 
textbooks for language teaching and learning. The textbook is also calibrated against the Common 
European Framework Reference (CEFR) for languages providing an additional CEFR level benchmark 
(A1- beginner to C2- proficient) to obtain a common basis for language textbooks and examinations. 
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The calibrated CEFR level would allow educators and course developers to improve teaching materials. 
This research contributes to the first step in such an attempt; finding out the readability of the English 
textbook. The second step will be to determine if this computed readability is consistent with the 
intended readers’ level, which is outside the scope of this research. 

The paper is organized as follows. The following section presents a review of the literature and 
focuses on the existing approaches in computing readability. Section 3 introduces the aims of the study. 
The methods and computational tools employed in the study are discussed in section 4. Section 5 
presents the results and provides a discussion on the same. The last section concludes with the main 
findings and directions for future research work. 


2. Literature Review 


Fundamental research in readability has posited that there are three broad aspects of readability. 
According to Gray and Leary, they are; subject or content matter, the style of expression used on the 
content matter, and the format and organization of the content matter (Dale and Chall, 1949). Dale and 
Chall (1949) categories the three different aspects into: typography (“style of type, the spacing between 
lines, the length of lines, the size of type, the width of margins, color of paper, etc., which make for 
optimal speed and ease of reading”), content and its effect on reading (“the subject matter, the themes 
and the elements in reading materials that appeal to different groups of readers”), and factors of style of 
expression (“What kind of vocabulary, sentence structure and other expressional elements best suit the 
abilities of particular groups of readers?”) (Dale and Chall, 1949). It is on this third aspect that the 
concept of readability is defined as “comprehensibility’. Subsequent research used vocabulary load and 
sentence structure as measures to measure comprehensibility. Present research in computing readability 
has come a long way with over 200 different formulas to measure readability. 


2.1 Traditional readability formulas 


One well-known, highly regarded readability formula is the Flesch Reading Ease formula which 
indicates the reading ease of a text. The formula devised by Rudolf Flesch in the early 1940s 
used two variables to compute the readability: average sentence length in words and average 
word length in syllables. The Flesch Reading Ease formula is: 
206.835 — 1.015 ( total words ) RAG (exe 
total sentences total words 

The computed score is between 0 and 100 and can be interpreted as 100 being extremely 
easy and 0 being unmistakably difficult to read. This classic readability measure takes into 
consideration the word length, sentence length, and syllables. This is based on the positive 
correlation between ease of reading and shorter phrases or words (Kitson, 1921; Flesch, 1948). 
Table 1 posits the style description and estimated reading grade of the Flesch Reading Ease 
score given by William DuBay (2004) (DuBay, 2004; Tabatabaei and Bagheri, 2013). 


Table 1. The Flesch Reading Ease score (DuBay, 2004) 


Reading score Style description Estimated Reading Grade (U.S.) 
0 to 30 Very difficult College graduate 
30 to 40 Difficult 13 -16" grade 
50 to 60 Fairly difficult 10% — 12" grade 
60 to 70 Standard 8" and 9" grade 


70 to 80 Fairly easy 7" orade 
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80 to 90 Easy 6" grade 
90 to 100 Very easy 5" grade 


Another traditional readability formula is the Dale-Chall formula that focuses on vocabulary 
knowledge as a factor that affects vocabulary. Dale and Chall considered vocabulary to be the most 
important factor, followed by sentence length (Dale and Chall, 1949). For this purpose, they used the 
Dale expanded word list of 3000 words understood by 80% of fourth-graders to measure vocabulary 
difficulty. The Dale-Chall readability formula is: 


dif ficult words words 


words 


0.1579 ( 100) + 0.0496 ( 


If the percentage of difficult words exceeded 5% then 3.6365 should be added. This computed score 
would be the adjusted score. A lower score denotes the text is easy to read while a higher score meant 
the text is difficult to read. The premise lies on the fact that readers find it easier to read when the words 
are familiar making processing and recalling familiar. These traditional readability measures are based 
on the surface characteristics of the text like sentence length, word length and vocabulary. It does not 
take into consideration the deeper levels of text processing. Specifically, in the case of the Dale-Chall 
formula, the readability measure depends on a fixed list of 3000 words which is problematic as 
vocabulary undergoes change and is not common for all speakers of a language. To counter the 
limitations of the traditional readability formulas, the focus of research shifted to computational tools 
that would consider language-specific features. 


2.2 Coh-Metrix L2 Readability formula 


The Coh-Metrix L2 Readability score predicts the readability of texts for second language readers. The 
readability measure for L2 text needs to account for L2 reader-based variables. Accordingly, readability 
formulas for L2 readers need to be sensitive to aspects like type, function, word frequency, and word 
redundancy in the text. “The Coh-Metrix L2 Readability formula relates more to simple text complexity 
evaluation tools whose aim is to describe a text in accordance with its syntax, vocabulary, morphology, 
semantic, etc.” (Kiselnikov, Vakhitova, and Kazymova, 2020). The Coh-Metrix L2 Readability score 
involves three variables: content word overlap, syntactic similarity, and lexical frequency. The Coh- 
Metrix L2 Readability score formula is: 


- 45.032 + (52.230 x Content Word Overlap- CRFCW01) + (61.306 x Sentence Syntax Similarity- 
SYNSTRUTa) + (22.205 x Lexical Frequency-WRDFRQmc) 


The content word overlap considers the proportion of content words that overlap between pairs of 
sentences or adjacent sentences. Sentence syntax similarity refers to the sentence-to-sentence syntax 
similarity computed by measuring the uniformity and consistency of syntactic constructions present in 
the text. Lexical frequency is the logarithm frequency of content words in the text (Crossley et al., 2008; 
Bailin and Grafstein, 2016; Graesser et al., 2004). This automated readability formula reports on these 
variables to select appropriate and authentic text. In the Coh-Metrix readability formula, the higher the 
indices, the more readable the text is. 


2.3 Common European Framework Reference (CEFR) 


CEFR published in 2001, focuses on second/foreign language teaching and learning in a plurilingual 
and multilingual context. CEFR provides a thorough insight into what second/ foreign language learners 
need to communicate using the language. CEFR “provides a transparent, coherent and comprehensive 
basis for the elaboration of language syllabuses and curriculum guidelines, the design of teaching and 
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learning materials” (Council of Europe, 2001). It is a common reference toolkit, a common basis for 
language textbooks, curriculum, and examinations across Europe and other countries. With the help of 
six-level descriptors, a learner’s ability to use a language is categorized into three main groups; Basic 
users (Al & A2), Independent users (B1 & B2), and Proficient users (C1 & C2). Since the introduction 
of CEFR in 2001, it has come to dominate discussions on L2 curricula, L2 teaching and learning, and 
assessment of L2 proficiency. 


In 2017, Collocated Representative Samples of Language Competences were developed for young 
learners aged 11- 15. The new descriptors and plus level scales were drawn from the European 
Languages Portfolios and other sources (Council of Europe, 2018). The introduction of these extended 
illustrative descriptors with respect to the plurilingual and pluricultural competence is based on “the 
perception of what is reasonable/ possible rather than ‘expected’ or ‘necessary’ in terms of cognitive 
and/or social development” (CoE, 2018). In the new extended set of illustrative descriptors, several 
‘overall’ descriptors present in the original 2001 CEFR at Cl and C2 levels are not displayed as ‘they 
encompass a degree of cognitive and social maturity’ that is not relevant to the age group. Instead, they 
contain ‘can-do’ statements that seem reasonably relevant to the 11-15 age group (mentioned in Table 
9). 


3. Research Objectives 


The study's main objective is to obtain a clear picture of the written texts in the Kerala SCERT 9™ grade 
English textbooks. For this, the researcher employed the use of readability formulas and the English 
Vocabulary Profile. The English Vocabulary Profile is a reference that contains information about the 
different words, phrases, and idioms used at each level of teaching and learning. The written texts were 
also aligned with CEFR levels to decipher the level of the textbook with reference to the common 
framework. Such documentation would provide a roadmap of options for educators. The results obtained 
could be incorporated for future studies to compare against the learners” readability level to find out 
whether the written text aligns with the learner's reading level or ability. 


Research steps taken in this study were: 
(1) Estimating the readability of the text using the Flesch Reading Ease formula. 


(2) Comparing the obtained Flesch Reading score against the style description and estimated 
reading grade posited by DuBay (2004). 


(3) Estimating the readability of the text using the Coh-Metrix L2 readability score. 
(4) Analyzing the text using the English Vocabulary Profile from the Cambridge University Press. 
(5) Aligning CEFR level to the written text. 

Aims of the study involved: 


1) Analyzing the readability of the written texts in 9" grade Kerala SCERT English textbook using 
yzing 2 gr 
two readability formulas, Flesch Reading Ease and Coh-Metrix L2 Readability. 


(2) Examining whether the order of written texts in the 9™ grade Kerala SCERT English textbook 
accord with the readability of the text. 


(3) Obtaining a clear understanding of the level of the textbook with reference to the common 
framework (CEFR) would allow for an estimation of its difficulty level beyond the state and 
country (CoE, 2001; CoE, 2018). 


(4) Creating a database for future studies on grading the secondary level Kerala SCERT English 
textbooks and estimating learners’ readability level. 
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4. Method 


The research employed a descriptive qualitative approach to analyze the corpus. The content analysis 
approach was chosen as the study only involved the written data in the textbook. For the material of the 
study, the researcher read the 9" grade Kerala SCERT English textbook to select passages for analysis. 
Eight passages were selected after carefully identifying the texts taught to the students. The selected 
passages (T1, T2, T3, T4, T5, T6, T7, and T8) were narrative, descriptive, factual, or anecdotal. Five 
Poems and one play were removed from analysis on account of poetic devices and repetitive 
construction. Out of the eight passages, four were by Indian authors, two by American authors, one by 
a Greek author, and one by a German author. Of the eight passages, seven passages were adapted text 
except for one passage that was by an Indian author. The different analysis of the corpus were carried 
out using the various tools discussed below. 


4.1 Coh-Metrix 


Coh-Metrix 3.0 is an online computational tool developed by Arthur C. Graesser and Danielle S. 
McNamara (Cohmetrix, 2017). It was developed as a project in the field of Learning under the Institute 
for Intelligent Systems, University of Memphis. The tool considers language properties beyond the 
simple and shallow metrics involved in traditional formulas; sentence, word length, and vocabulary. 
Coh-Metrix makes use of the language and discourse components that exist in the text for analysis. Coh- 
Metrix was made possible with the advances in disciplines like psycholinguistics, corpus linguistics, 
computational linguistics, discourse processing, information extraction, and retrieval (Crossley, 
Greenfield, and McNamara, 2008). According to Graesser, McNamara, Louwerse, and Cai (2004). Coh- 
Metrix “analyses texts on over 200 measures of cohesion, language, and readability. Its modules use 
lexicons, parts-of-speech classifiers, syntactic parsers, templates, corpora, latent semantic analysis, and 
other components that are widely used in computational linguistics” (Graesser et al., 2004). It considers 
deeper levels of text processing: decoding, syntactic parsing, and meaning construction (Crossley et al., 
2008). Coh-Metrix thus reports on detailed language and cohesion features. Cohesion is an objective 
property of the text which reflects the degree of logical connectiveness between the sentences. Coh- 
Metrix examines the test using linguistic features that are related to cohesion. The different indices 
reported by Coh-Metrix are grouped into 11 sections: 


e Descriptive 

e Text Easability Principle Component scores 
e Referential Cohesion 

e LSA 

e = Lexical Diversity 

e Connectives 

e Situation Model 

e Syntactic Complexity 

e Syntactic Pattern Density 

e Word Information and 


e Readability 
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These sections (comprising 106 indices), readability reports Flesch Reading Ease, Flesch-Kincaid 
Grade Level, and Coh-Metrix L2 Readability score. For the study, the Flesch Reading Ease and Coh- 
Metrix L2 Readability score were computed from the analysis done with Coh-Metrix. 


4.2 Text Inspector 


The CEFR level of the selected passages was computed using the Text Inspector tool (Text Inspector, 
2020). This online language analysis tool was created by Stephen Bax (Professor of Applied Linguistics) 
supported by academic partners, the Centre for Research in English Language Learning and Assessment 
(CRELLA). Text Inspector provides the option of selecting the mode of text that is to be analyzed from 
three modes of text; writing, reading, and listening. For this study, the passages were analyzed using 
reading as the mode of text with an additional option to exclude all numbers from the analysis. Text 
Inspector provides detailed information regarding readability (Flesch Reading Ease), complexity, lexical 
diversity, estimated CEFR level, English Vocabulary Profile (EVP), British National Corpus (BNC), 
Corpus of Contemporary American English (COCA), Academic word lists (AWL) and phrases and parts 
of speech tagger. The English Vocabulary Profile tool in the Text Inspector was developed with the help 
of the Cambridge University Press and Cambridge English Language Assessment as part of the English 
Profile Project. This tool uses the Cambridge Learner Corpus (CLC) to help analyze texts in terms of 
their CEFR level (A1-C2) by classifying words by levels according to their meaning. 


The CEFR level estimated in Text Inspector is based on the Overall Lexical Profile of the reading 
text. The Lexical Profile of the reading text is calculated based on the following metrics: 


e Statistics (sentence, word and syllable count, type/token ratio, etc.) 

e Lexical Diversity (MTLD- Measure of Textual Lexical Diversity and vocd-D) (Duran, Malvern, 
Richards and Chipere, 2004; McCarthy and Jarvis, 2010) 

e Lexical sophistication: EVP (including % of words at each level) 

e Lexical sophistication: BNC 

e Lexical Sophistication: COCA 

e Lexical Sophistication: AWL (Coxhead, 2000) 

e Metadiscourse markers (linguistic features that add extra information to a text; 13 categories of 
metadiscourse markers, Hyland, 2004; Bax, Nakatsuhara and Waller, 2019). 


Based on these metrics that were attributed CEFR levels, the overall analysis provided an estimated 
CEFR level of the passages mentioned in Table 7. In CEFR (2001), the 3 level groups (A (basic user), 
B (independent user), and C (proficient user)) with 6 level scales (A1-C2) are as follows: 


e A1- Beginner (Breakthrough) 

e A2- Elementary (Waystage) 

e B1- Intermediate (Threshold) 

e B2- Upper Intermediate (Vantage) 

e Cl- Advanced (Effective Operational Proficiency) 
e (C2- Proficiency (Proficiency) 


However, the Text Inspector further subdivides these levels into four levels in each level as 
mentioned below following the revision of the original descriptors in 2008 to include plus levels: 


e Al] - Beginner 
e Al+- Beginner 


e A2 - Beginner 


Pappachan / 2021 47 


e A2+ - Beginner 

e BI - Independent 

e Bl+ - Independent 

e B2-—- Upper Intermediate 





e B2+ - Upper Intermediate 
e Cl - Advanced 

e Cl+ - Advanced 

e C2 - Proficiency 





e (2+ - Proficiency. 
e D1 -— Academic (University- undergraduates) 


e D2 - Academic (University- postgraduates and above) 


5. Results and Discussion 


This study carried out the statistical analysis of computing readability of the passages using the Flesch 
Reading Ease readability formula in Coh-Metrix. The Coh-Metrix L2 Readability score was also 
computed using the same computational tool. The third and fourth step in the study was conducted 
through the EVP and Lexical Profile analysis via Text Inspector. 


Table 2 shows the Flesch Reading Ease computed for the eight passages in the corpus. 


Table 2. Flesch Reading Ease score of passages 


Passages Tl T2 T3 T4 TS T6 T7 T8 
Flesch 82.378 72.651 76.840 67.018 78.108 86.557 84.630 78.885 
Reading Ease 


Table 3 compares the computed Flesch Reading Ease against the description provided by William 
DuBay (2004) as discussed above. The higher the Flesch Reading score, the easier the text is to read. 


Table 3. Flesch Reading Ease score with reference to DuBay’s description (DuBay, 2004) 


Passages Flesch Reading Ease score Style description 
Tl 82.37 Easy (80-90) 

T2 72.65 Fairly easy (70-80) 
T3 76.84 Fairly easy (70-80) 
T4 67.01 Standard (60-70) 
TS 78.10 Fairly easy (70-80) 
T6 86.55 Easy (80-90) 

T7 84.63 Easy (80-90) 


T8 78.88 Fairly easy (70-80) 
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Based on Table 2 and Table 3, it can be seen that the passages are not ordered with respect to a 
standard readability procedure. If this standard readability procedure is followed, the passages’ logical 
rank ordering is illustrated in Figure 1. should be as mentioned in Table 4. For example, T6 should be 
given the logical rank order T1; that is, the text's sixth passage should logically be presented as the first 
passage in the text. The original first passage T1, should be presented as the third passage in the text. 


Figure 1. Clustered column chart of the logical rank order of passages in standard IX corpus based 
on Flesch Reading Ease score 


70 
60 
40 
30 
20 
10 

0 

T6 T7 T1 T8 15 T3 T2 T4 


Logical order of passages 


Flesch Reading Ease score 
wo 
oO 


™@ Flesch Reading Ease score 


It can be seen that of the eight passages, the fourth passage has the lowest readability score, the 
highest being 86.55 and the lowest being 67.01. Interestingly, the only passage that seemed to be 
attributed ‘standard’ readability and estimated at ‘8", 9" grade is the original rendition of the extract 
from Mohandas Karamchand Gandhi's autobiography, “The Story of my Experiments with Truth’. All 
the rest of the passages are adapted versions of the original text. When the researcher enquired about the 
adaptation process, it was told that no significant process was carried out. 


The Coh-Metrix L2 Readability score of the passages is as provided in Table 4. 


Table 4. Coh-Metrix L2 Readability score of passages 


“Passages = Coh-Metrix L2 Readability 
score 
TI 20.359 
T2 21.263 
T3 15.436 
T4 20.624 
TS 16.237 
T6 18.186 
T7 15.860 


T8 18.276 
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With the understanding that the different readability score will be accorded in the same range, it was 
surprising to note that the passage that had the lowest Flesch Reading Ease score, 67.01 (making it 
highly difficult) was attributed to be the second easiest passage to be understood with the Coh-Metrix 
L2 Readability score of 20.62 (Table 4). On close examination, it was observed that the difference in 
the readability was because of the high cohesion between sentences in the text (T4- 0.107) in contrast 
with the referential cohesion of other passages. The Content Word Overlap variable (in the Coh-Metrix 
L2 Readability formula) discusses the referential cohesion in the text. This shows that there is a higher 
degree of logical connectiveness between the sentences, unlike the other adapted passages in the text. 
This might be because of the fact that this passage was an extract from the original text and not an 
adaptation. Also, the easiest passage as per the Coh-Metrix L2 Readability score was found to have the 
highest referential cohesion- 0.131. This made the passage (T2) that had second to least Flesch 
Readability Ease score being computed as the easiest passage in the corpus. These two passages (T2 and 
T4) also had a high average sentence length and a high average syllables per sentence when compared 
against the same in the other passages. 


Figure 2. Clustered column chart of the logical rank order of passages in standard [X corpus based on 
the Coh-Metrix L2 Readability score 
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With respect to the second aim of the study, simply computing Flesch Reading Ease does not help in 
predicting readability. Though sentence length certainly plays a role in readability, a high cohesion text 
will yield better readability than a low cohesion text with a lesser average sentence length and average 
syllable per word. In this sense, the logical rank order of the passages based on the Coh-Metrix L2 
Readability score is mentioned in Figure 2. 


From the analysis using Text Inspector, the estimated CEFR level of the passages is shown in Table 


Table 5. Estimated CEFR level using Text Inspector 
“Passage  CEFRIevel Level description 
Tl A2+ Beginner 
T2 Bl+ Independent 
T3 B2 Upper Intermediate 
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T4 Bl+ Independent 

TS B2 Upper Intermediate 
T6 A2+ Beginner 

T7 A2+ Beginner 

T8 Bl+ Independent 





From the estimated CEFR level, it can be seen that only two passages, T3 and T5, were estimated at 
CEFR B2- Upper Intermediate, and three passages, T2, T4, and T8, were estimated at CEFR level B1+- 
Independent. A more detailed analysis observed that passages T3 and T5 had a higher percentage of 
words that belonged to the BNC and COCA corpus, lending to a higher CEFR level in the lexical 
sophistication metric of the Lexical Profile calculation. 


With respect to the research step of analyzing text using EVP, it was seen that the percentage of 
words (types) at each CEFR level, Al, B2, and Cl, did not follow any significant logical order. The 
percentage of words (types) at each level is mentioned in Table 6. 


Table 6. EVP: % of word (types) at Al, B2, and C1 levels 


“Passages ——-% of word (types) at Al__—‘% of word (types) at B2__-% of word (types) at C1 
Th 53.420 90 

T2 50.86 6.49 1.95 

T3 46.43 8.28 2.32 

T4 50.79 6.46 3.58 

T5 44.95 8.89 - 

T6 54.77 5.34 1.70 

A) 53.10 6.57 1.40 

T8 48.82 7.50 1.51 


Based on the results shown in Table 6, the fourth passage had a high concentration of words that 
belonged to the higher level of C1. During the analysis, it was seen that from the passages, an average 
of 10.90 % of words (types) went unlisted when the passages were analyzed according to the EVP. So, 
the researcher went through the analysis of the text using the BNC and the COCA. This analysis 
observed that the % of words (types) that went unlisted were mainly names (people and places), 
numbers, and certain phrases that did not have a reference in EVP. The analysis also revealed that there 
was a spelling error with the word “sockes’ in passage eight. 


Of all the passages, only the seventh passage belonged to a descriptive text type. The passage was 
computed to have a high Flesch Readability score and a low Coh-Metrix L2 Readability score. 
Accordingly, it is an easy text with respect to the former, and with respect to the latter, it is a difficult 
one. The estimated CEFR from Lexical Profile places it at A2+ (Beginner). This is because of the lack 
of cohesion in the text as the text type is descriptive and has a low average sentence length and low 
average syllables per sentence. However, the Metadiscourse metric of Lexical Profile places it at C1+ 
on account of the high metadiscourse markers other than logical connectives. The other Metadiscource 
markers consist of textual metadiscourse markers- sequencing, label stages, announce goals, topics shifts 
as frame markers; code glosses; endophoric markers; evidentials and interpersonal metadiscourse 
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markers- attitude markers, hedges, relational markers, person markers and emphatics (Bax et al., 2019; 
Hyland, 2004). 


The Lexical Profile uses only a limited number of measures to estimate CEFR levels. This estimation 
is limited as it does not take into account other aspects of grammatical complexity and context of use. 
For a more definite calibration of CEFR level, it is necessary to analyze what the different passages 
require the learners to do as per the curriculum directives. This would allow for a clear-cut, distinct 
estimation of CEFR level. The illustrative descriptors in the form of “can-do” statements of CEFR level 
provided in the “Descriptors for young learners aged 11 to 15° is given in Table 7. These illustrators 
outline the descriptors for overall reading comprehension in “written reception’ (CoE, 2018). 


Table 7. Illustrative descriptors ‘can-do’ statements for different CEFR levels for ages 11-15 


A2 Can understand short, simple messages and texts containing basic vocabulary relating to areas of 
personal relevance or interest. 


A2+ — Can understand short, simple texts on familiar matters of a concrete type which consist of high 
frequency everyday or school related language 


Bl Can understand texts of varying lengths in which words are used which are familiar to me from my 
daily context or because they concern my main area of interest. 

B2 Can read with a large degree of independence, adapting style and speed of reading to different texts 
and purposes. Have a broad active reading vocabulary but may experience some difficulty with low- 
frequency idioms. 


Cl Can understand in detail lengthy, complex texts whether or not they relate to his/her own areas of 
speciality, provided he/she can read different sections. 


C2 Can understand virtually all forms of written language including abstract, structurally complex, or 
highly colloquial literary and non-literary writing. 


From the 2018 extended list of CEFR levels for ages 11 to 15, C1, Cl+, C2, and C2+ are mentioned 
as levels that will be relevant to the higher end of the age group. Considering the fact that a typical 
student would be between 14-16 at the 9" grade, it seems alarming to note that not a single passage in 
the text could be estimated at CEFR levels B2+ and above. Suppose, on further analysis, the learner’s 
language competence is B2 (as mentioned in Table 7), then it seems counterintuitive to provide a text 
aligned with CEFR score B2 and lower. This defeats the central purpose in the teaching-learning process 
that depends on motivating and challenging the student. 


6. Conclusions 


The data reported in the result and discussion provides the computed readability score using the Flesch 
Readability Ease formula and Coh-Metrix L2 Readability formula. It also discusses the logical ordering 
of the passages in the text and presents CEFR levels. 


Analyzing the results so far, the Lexical Profile analysis estimating the CEFR level of the text can 
be seen as an adequate representation of the level of the text. As per CEFR (2001), the different CEFR 
levels provide reliable grading of reading text. From a review of Ivanka Natova (2019) study, the 9"- 
grade equivalent CEFR can be observed to be between B2 and C1 (Natova, 2019). With this in mind, a 
reading text of B2, B2+ and C1 would prove to form an adequate text for 9 graders. However, as the 
levels of development and literacy vary depending on social, national and educational context, further 
research and detailed, in-depth analysis need to be carried out to map out the learners’ proficiency level 
and readability level when they enter the 9" grade and subsequent study into their interest, motivation 
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and background knowledge. This study thus provides a preliminary groundwork for further studies. The 
data collected on the readability formulas will help grade other Kerala SCERT English textbooks at the 
High School level. Through research on learner’s end-grade competence, the CEFR level can be plotted 
with the help of its ‘can-do’ descriptors. This would let educators and researchers to align text according 
to the learner’s CEFR level. In this regard, the first task of estimating the CEFR scores of the textbook 
is performed through this study. 
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relations, abstract concepts, As we all know children undergo the complex language acquisition process 

concrete concepts, Malayalam seemingly in an effortless manner and even acquire the ability to comprehend 

speaking community, mind, a number of complex concepts at a very young age itself. Psycholinguistics 

generativity. is the branch of the study that explores the intricacies involved in this 
complex process. The current study focuses on the psycholinguistic aspects 
underlying the comprehension of complex concepts in semantics such as 
sense relations, abstract concepts and concrete concepts. It has attempted to 
bring out the involvement of the mind through its generativity capacity. The 
highlight of this study is that it is conducted in a comparatively less explored 
Malayalam language speaking community belonging to the Indian state of 
Kerala. Malayalam is an official language recognized by the Indian 
constitution that has the classical status. The data collection was a continuous 
process and it was collected from six informants belonging of age below 
three. 

1. Introduction 


Among the many complex phenomena in the world, language acquisition occupies a 
high position based on its complexity. But surprisingly in children this process occurs in a 
seemingly effortless manner even with the comprehension of complex concepts at a very young 
age itself. Psycholinguistics, the study that centers around the relationship between mind and 
language tries to find out the intricacies involved in the complex language acquisition process. 
Psycholinguistics uses language and speech as a window to the nature and structure of human 
mind (Scovel: 1998). There are four domains of psycholinguistics: acquisition, comprehension, 
production and dissolution, these domains are linked to each other in a chain process. 
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Acquisition — Comprehension — Production— Dissolution 


Acquisition explains how the child acquires his first or the native language (mother 
tongue). Comprehension explains how children understand sounds, words and sentence in a 
language. Production of language is the manifestation of its acquisition and comprehension. 
Dissolution means how language loss happens. 


The present study concentrates upon the psycholinguistic aspects that assists the 
comprehension of complex concepts in semantics. The study has tried to unearth the significant 
role of human mind in the concept development process by exhibiting its generativity capacity. 


The study is carried out among the native speakers of Malayalam language (native 
language of the south-Indian state of Kerala). Malayalam is comparatively a less explored 
language in the field of language acquisition research. The data collection process was a 
demanding one, mainly because the informants were of the age between | to 3 years. 


1.1, Literature review 


The acquisition of meaning starts until the infant begins to acquire words. For later 
developments of meaning need roots in earlier developments. This is true for the growth of the 
awareness of meaning. The pre linguistic developments of word acquisition emphasize two 
major goals 


a) To develop the notion of ‘word’ 
b) To acquire the meanings of the words in the native language. 


The former development precedes the latter. The infant arrives at the realization that 
vocalizations may be meaningful. Here the linguistic sign is the result of the Signifier- signified 
relation where the signifier is some group of sounds and the signified is the concept these sounds 
stand for or represent. For example, the Malayalam word ‘maram’ is a phonetic string [maram] 
paired with the concept MARAM that we recall when we hear these sounds. 


The linguistic sign has two important properties. One is the relation is arbitrary. There is nothing 
about the signified that requires a particular signifier. The English sound [tri] for ‘maram’. The 
English sounds also represent the concept. Second thing is the concepts are abstract 
representations. Some of the concepts can be quite abstract, e.g. Love. The young child, then, 
needs to develop the signified-signifier relation, abstract concepts, and the arbitrary nature of 
the relation. 


Piaget (1952) argued that the signifier-signified relation of the linguistic sign has its roots in 
more primitive relations of signifiers and signified of the sensorimotor period. The signifier 
signified relation goes through from birth to the time when the linguistic sign is developed. 
Arbitrary signs do not occur until the end of the sensorimotor period. It is important to realize 
that the signifier-signified relation can apply to sensory impressions. To use one of Piaget’s 
examples, we may observe a tree while sitting in a park. We may not see all of the tree and 
certainly we do not observe it from all angles or perspectives, yet we know that we are watching 
a tree. The signified, then, is TREE, the intellectual concept, not the sensory attributes. The 
latter are the signifiers that allow us to recognize the concept that represents the object. 


According to Imai (2005) Nouns should be learned more easily in early stages of lexical 
development “because concepts denoted by nouns are cognitively more coherent and accessible 
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than concepts denoted by verbs. Verbs may not be learned as easily as nouns due to following 
reasons: First, actions can be more difficult to encode and to remember than objects as the 
concepts of verbs typically denote. Actions are less tangible perceptually than concrete physical 
entities. 


Sometimes children try to over extent the meaning of a word. Marcus (1992) state that ‘The 
phenomenon of overgeneralization itself is not in doubt, nor is the creative nature of the 
psychological processes that cause it’. 


1.2. Research question 


The present study tries to find out how the generativity capacity of the minds of children 
under the age of three helps them in the complex conceptual development process occurring at 
the semantic level of language acquisition. It also explores the psycholinguistic aspect 
underlying the complex conceptual development process and the generativity capacity that 
works behind this process. 


2. Method 


The study is a descriptive and quantitative one which comes under the developmental 
framework. There for it is executed as a longitudinal one. 


2.1. Sample / Participants 


Data was collected from six informants belong to the Malayalam speaking community 
and were of the age from 1 to 3 years. The researcher has maintained an equal ratio of male and 
female distribution while selecting the informants. There were three female and three male 
informants. The investigator also has attempted to bring inclusiveness in the selection of 
informants. Therefore, children have been selected from Hindu, Christian and Muslim religion. 
To maintain an equality in the economic background of the informants the investigator has 
selected all the six informants from middle class family. 


2.2. Data collection procedures and data analysis 


The data collection method of the present study followed the direct and participatory 
observation (involving parents and caretakers) method. The investigators designed a schedule 
to conduct the data collection in an effective and organized manner. According to the schedule 
constant observation of the informant was conducted, dairies were maintained for the six 
informants in which parents and care takers of the children could update entries and the 
investigators regularly reconfirm the observation entered in the diaries. The collected data was 
analysed and described at the semantic level. 


3. Analysis 


The study of Semantics deals how linguistic elements carry meaning. As Language is a 
symbolic system each symbolic elements or words are linked to specific concepts. Eventhough 
the process of concept development is complex children acquire this stage of development 
extremely rapidly and simply. 


In order to acquire the semantics of a language, a child must 1. identify the relevant linguistic 
items 2. identify or understand the meanings linked to these items and 3. learn how these forms 
connect to their respective meanings. There is a relation between language and cognition. 
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Concepts help a child to understand about direction, location, position, number, quantity, 
sequence, attributes, dimension, size, similarities and differences. In order to function in a 
society, one must learn the rules and structures of the language system. Structure of language 
helps a child to become more specific in its understanding, and it's use provides the knowledge 
of concepts. There are two types of concepts: abstract and concrete concepts. Abstract concept 
is the one which cannot be learnt through definition; e.g., spatio-temporal concept whereas the 
concrete concept is the one that can be learnt through definition. The linguistic system conveys 
meaning and the conceptual system interprets it. This study mainly concentrates on concept 
development in children. 


The semantic analysis identified that from babbling stage the kinships are identified even with 
overgeneralisation. Thus, the first identified grammatical categories are the nouns. The data of 
noun classification identified before the age of 3 in children has attained almost all noun 
categories. The complex categories of abstract nouns such as ‘sahayam’ ( meaning help) are 
also possible for the child’s mind to acquire before the age of 3. Thus, child’s mind is capable 
to acquire the linguistic information about nouns at this stage. This was supported by the 
observed data which reveals that, during this time the vocabulary developed by the child 
includes question words, spatial and temporal concept denoting terms, negative words, 
directional terms etc. This shows that the number of nouns are greater in number than verbs. In 
two-word stage children become experts in vocabulary development. In multi word stage they 
try to use complicated words with complex concepts. The data reveals that children show a 
tendency to use antonymous words for the actual words they intend to use eg. /patukke/’ softly’ 
for /urakke/ ‘loudly’. From the data it is evident that the child depends upon antonymous terms 
because of the shortage of word in the mental lexicon that helps them to reveal the idea that 
they want to communicate but here the generativity element is evident because the child uses 
the antonymous terms of the concept or idea. 


Gradually child starts to use most of the synonymous words for simple vocabulary. For eg, 
/kataka/ ‘door’ the child use /jennel/ ‘window’, For niRaye /ottiri child use /valiyatu/‘bigger’. 
Here the child co-relates the physical appearance of things with its mental lexicon to produce 
the terms signifying the things around. This can be seen as a generativity element because the 
child finds some similarity in the physical appearance of the things that helps them to build a 
co-relation with the concept and appropriate terms in the mental lexicon. 


There is also hyponymous words which are identified in child’s vocabulary. They are very few 
in number. When children assign a meaning to an unfamiliar word form, they must take into 
account different kinds of semantic relations. They give information about a relation like class 
inclusion in utterances. e.g. /migangal / ‘animals’, ‘This is a kind of animal’, car/bus> vandi 
‘vehicle’. This shows that the child’s mental lexicon has the concept of hyponymy. Therefore 
the child uses hyponymous terms in suitable contexts. 


Children’s initial hypotheses about word meanings arise from the conjunction of social and 
conceptual knowledge. The people and the environment around the child helps it to think in the 
right direction and arrive at the correct meaning. e.g. the noun /sahayam/ ‘help’ is conceptually 
abstract. For eg, when the cartoon ‘Dora-bujji’ is broadcasted in the television. It attracts a large 
number of children of different age groups. This cartoon is presented in such a way that the 
viewers also get an opportunity to interact with the main character of the cartoon. This is 
obviously an exciting element for children. The child only after getting the correct meaning of 
the word attends to store it in the mental lexicon. This is the reason behind minimal error in the 
usage of language by children. We can also find that there is no overgeneralization except in 
the early stages after the use of kinship terms. 
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/enikko palla tekkanam enna orma/ ‘I want to brush my teeth, it’s my memory’ From the data 
collected it is evident that the child wants to convey that he felt like brushing now but here, he 
has used a noun ‘memory’ instead of the actual verb felt. This example points out that the child 
has depended upon the noun for expressing abstract concept instead of verb. Similarly, in 
another instance the child tends to use noun ‘good’ instead of the verb ‘convulsed’. 

Gradually the child starts to use exact verbs to express the abstract concepts but we can observe 
the presence of overregularization too. From the data it is observed that instead of /kaduva cattu/ 
‘tiger died. The child tends to use /kaduva kolli/ ‘Tiger killed’. 


Finally, we see that the child has attained the level of using the right verb at the right place. It 
is evident from the data when the child says /pona patta veccu/ ‘The phone is kept hiding’ 
instead of saying /pona kanate poi/ ‘The phone 1s missing’. Similarly, in an instance the child 
uses the sentence / dOttarpuppan tinnu poi / ‘Doctor grandfather exhausted’ instead of doctor 
grandfather has already moved away from the front of its house. There is a third instance from 
the data showing the child using the verb ’switch off ‘instead of ‘remove’ the child tells /cticca 
Oppo ceyya/ ‘Switch off the shoes. 


/kai muRinnu > /kai potti poyi/ ‘hand broken’ (There is an injury in the hand). 


From the collected data it can be observed that children tend to co-relate the knowledge of 
different grammatical categories of Malayalam before they produce it through there speech. It 
takes a little more time for the child to reach the level of adult’s usage of language. 

From the data obtained at the month of 19 of a child, it becomes evident that the child has no 
idea about the size of an object. e.g. when a broken candle hangs from its thread the child calls 
it /ana/ ‘elephant’. The moving part of the object is conceptualised by the child as the trunk of 
an elephant because here the child has not noticed the size of the elephant. Kellman (2013) point 
out that the ability to detect motion-carried information is fundamental to the development of 
perceptual system. 


By the child attains an age of 33 months he becomes aware of the size of the object. 


eg. when the child was provided with small pieces of potato, he asked for bigger one. He used 
exactly these words /itintRe ammaye ta/- give the mother of this. Here the child co-relates with 
the size difference and psychological attachment he has with his mother because at this stage 
the child’s world is his mother. This is also evident from the above given data. 


From the data obtained the sentence /kattilinte adilo ratiri/ ‘under the coat it is night’ the term 
/ratri/ relates to the darkness of the night. The child refers to the darkness under the coat to the 
darkness of the night or lack of light during the night time. This happens because the child has 
experienced darkness during the night, therefore the child tends to relate his experience of 
darkness with the darkness under the coat. 


It has been observed that the children tend to grasp concepts relating to space much quickly 
than concepts relating to time. This might be because the social environment of the child tends 
to develop an urge in him to grasp the spatial and directional concepts and use it in their 
communication to fullfill their needs. When it comes to temporal concepts, we can only find 
three terms before the age of three in children. These terms refer to the concept of night, 
tomorrow (child tends to use it in every situation relating to temporal concepts). It might be 
because adults tend to use these few terms more often to the children through baby talk and 
motherese. 
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The concept of death is very complex for the child to understand before the age of three. 
Child do not understand the meaning difference between /kolluka/’kill’ and /cakuka/ ‘died’ 
from the data /kaduva kolli/ (tiger killed) for /kaduva cattu/ ‘tiger died’. Here the child does 
not understand the difference between killing and dying. 


The death of dove, the child understands that something had happened to the dove since it 
remains immobile and lye on the ground. When the child saw a ball deflated, the very next day 
he says, the ball also died. Similarly, he relates the immobility and lying of the broomstick on 
the floor to the death of the broomstick. The child also relate the dipping of the piece of biscuit 
and drowning of the same into the milk to the death of the biscuit from the observed data /ctila 
cattupoyi/ ‘broomstick died’ and /bikkattu munni munni cattu/ ‘chappathi ‘dipped,dipped and 
died’ In first case, the child tend to relate the death of the dove and the deflation of the ball on 
the basis of living and non-living concepts. But in second case the child considers the non- 
functionality of the broomstick to the concept of death. And finally, in third case, the drowning 
and disappearance of the piece of biscuit tends the child to relate it to the concept of death. 


The child’s concept of death is changing. Babies do not have the cognitive capability to 
understand an abstract concept like death before two. After two years child understand the non- 
functionality concept in death. While a small child will not be able to intellectually understand 
what death is, he or she will certainly notice the absence of a significant person. Wells (1995) 
argued that children initially approach death with curiosity, whereas emotions of stress, fear 
and sorrow appear later and were adopted as behavioural patterns by the environment. They 
develop an intense curiosity as to what happens to the human body, while at the same time they 
consider that dead people are in a state of protracted sleep, but generally not different to that of 
the living. 


4. Conclusion 


Analysing the data, collected from the semantic level of language, reveals that children exhibits 
the capacity of generativity through their performances. Even though all the six children acquire 
language in an identical pattern, there is difference in the period of acquisition within them. 


The capacity of generativity in children is identified from the overgeneralizations observed. The 
data obtained from the production level of acquisition and comprehension stages highlights this 
overgeneralization tendency. Though overregularization seems to be awkward in the use of 
language, in children it is considered as a manifestation of creativity. This throws light on 
Chomsky’s innate grammar, that proposes universal grammar which suggests that children are 
born with innate linguistic knowledge that guides the language acquisition process. The child 
achieves basic mastery at an early age and become capable of producing an infinite number of 
sentences, which is called language creativity. 


The capacity of generativity identified in children is actually the manifestation of the mental 
processes involved in minds of the children. According to the analysis conducted at the 
semantic levels of linguistics one can observe the generativity element which is actually the 
result of the underlying mental process. 
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Government schools, A recent report of “Annual status of Education Report” has revealed that only 
U.P. Board schools, English 36.2 per cent students in grade V in U.P board government schools are able 
as a second language (ESL), to read textbooks of grade II (Annual Status of Education Report (2018), 
language skills, 2019).One of the obvious and primary reasons for students of U.P. board not 
Communicative Competence, been able to read and specifically speak in English fluently and intelligibly 
Pedagogical approaches. is that there has been little or no emphasis on speaking and reading skills. 


Unlike most of the English medium private schools they are not encouraged 
to participate in classrooms where they can use second language for 
discussions. Hence, they remain ignorant of the situational and natural use of 
English as a second language. Today, a special emphasis is given to the 
learner centred approaches of language teaching in English medium schools 
where learners are free to participate and discuss in language classrooms. 
Consequently, students do not struggle to communicate with ease in English. 
The present study tried to find out the linguistic and communicative 
competence of ESL learners of U.P board schools. The sample consisted of 
78 students of U.P. board schools of district Aligarh. The data was collected 
through a semi-structured interview schedule which were done individually 
as well as in group. Besides, the researcher also took a test of some of the 
skills (LSRW) to assess the linguistic competence of the learners. The results 
obtained revealed that the emphasis on Grammar translation method hampers 
the communicative competence, speaking skills, and pronunciation of 
students. Absence of extra-academic activities is also found to be a major 
reason behind little or no personality development of a large number of 
students. 


1. Introduction 


With a remarkable improvement in the proportion of literates over the decades, i.e. from 18.33 
per cent in 1951 to 74.04 per cent in 2011 (census, 2011), India has undoubtedly made a 
significant growth in educational attainment. However, had some states such as Uttar Pradesh 
been better at literacy rates, the numbers would have been more encouraging. While in states 
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like Kerala and Mizoram the literacy rates are more than 90 per cent, Uttar Pradesh on the other 
hand could score only 67.68 per cent (census, 2011) which is far lesser than the top figures. The 
recent report of “Annual status of Education Report” has also revealed that only 36.2 per cent 
students in grade V in U.P board government schools are able to read textbooks of grade II 
while the per cent for the same in Kerala is 73.1(Annual Status of Education Report (2018), 
2019). Researchers believe that it is the foundational weakness of children caused due to poor 
or no primary education which leads to illiteracy among young population as primary education 
is the first step towards establishing fundamental literacy and numeracy skill in children 
(Chowdhury, 1995; Hwang, 2017; Roser and Ortiz-Ospina, 2020). Moreover, primary or 
elementary school age is also a brilliant time to learn a second language. The critical period 
hypothesis first put forward by Penfield and Roberts (1959) and later popularized by Eric H. 
Lenneberg (1967) in his book “Biological Foundations of Language” states that the period from 
early childhood to adolescence is the critical period of language acquisition (Penfield and 
Roberts, 1959; Lenneberg, 1967). Therefore, it is one of the primary concerns of the 
government and the concerned policy makers to provide quality of elementary education as 
well as basic proficiency in English to every child specifically in states like Uttar Pradesh in 
order to prepare a skilled future workforce. 

Although it is pretty obvious that being the most populated (also in terms of children), 
Hindi dominant state of India, providing English medium schooling to every child is a bigger 
challenge for the U.P. government; yet there have been conscious efforts from the state as well 
as the central government to transform schooling into learning by controlling and co- 
coordinating the impart of basic education in schools other than high schools or intermediate 
colleges. In this regard, the Uttar Pradesh Basic Education Act, 1972 was passed by the Uttar 
Pradesh Government as an urgent need to improve the literacy in the state (U.P. Act No. 34 of 
1972). Prior to 1972, District Boards and municipalities including municipal corporations etc. 
used to run the primary and upper primary schools in Uttar Pradesh. 

Besides, following the high demand of English medium schools from stakeholders, 
Minister of state for basic education, Anupama Jaiswal had announced that Uttar Pradesh would 
have nearly 5,000 state government-run English medium primary schools in the coming years 
(Jain, 2017), which have been functioning since April, 2018. 





Figure I shows an English medium UP board Primary School in Lucknow. Image source: 
Hindustan Times 
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However, in spite of all these efforts it has been observed that there is hardly any change at the 
ground level (Mullick, 2018) as there has not been any new appointment of trained English 
teachers (Bhatia, 2019). Therefore besides providing modern education to all children, there is 
also an urgent need to train the teachers and to critically analyze the pedagogical approach 
adopted in U.P board schools specifically while teaching English as a second language to yield 
the desirable results from students. 
1.1 U.P Board Government Schools: A Critical Overview 

Government schools across the state have been found to be inferior in terms of quality 
education and sanity as little or no priority is given to improve the overall standard of these 
schools. Consequently, people who could afford the sky high fees prefer to send their children 
in private schools which offer much better services. Therefore, to equalize the educational 
standards of the government schools with the private ones, it is vital to identify and bridge the 
gaps responsible for substandard educational services in these schools. Following are some 
major systematic flaws in government schools across the country specifically in Uttar Pradesh 
which impede learning environment and affect the educational development of children. 


(i) Lack of Trained Teachers 

It is the most prominent and a long pending issue which has been brought into light by 
media reports time and again for several years. Nearly a decade ago, it was reported in India 
Today magazine that there is a shortage of 1.40 lakh teachers at the primary level in U.P. 
(Yadav, 2011), and as of 2018 there was still a shortage of 97000 primary teachers in the state. 
These numbers were quoted by the present C.M itself, who also stresses that government is 
concerned about the recruitment of teachers on the basis of merit but the failing education 
system in state hasn’t been able to produce trained and qualified teachers (“Not enough qualified 
teachers”, 2018; “Shortage of 97000 primary teachers”, 2018). Surveys also revealed that 
about 16% of the total schools in the state are single-teacher schools, i.e. schools run by only 
one teacher (Mullick, 2016; Raghavan, 2017). 

To fill this shortfall and minimize the teacher pupil ratio up to the norms, the state 
government recruits Shiksha Mitras (para teachers). However, most of the Para teachers are 
also untrained and do not undergo any special training after they are recruited. Lack of trained 
teachers affects students’ learning outcomes and they achieve low results in the board exams. 
The data of the UP board exams, 2019 shows that 139 schools in the high school category and 
249 in the intermediate category registered below 20% pass result (Mullick, 2019). Centre for 
Budget and Governance Accountability in its study found that not even 1% of the school 
education budget is spent on teachers’ training, which is a major reason for their incompetency 
in teaching (Kundu, 2018). 


(ii) Poor Infrastructure and Unwholesome Mid-Day Meal 

Proper infrastructure and nutrition form essential requirements of a good learning 
environment for children. As far as infrastructural facilities are concerned, many private, 
schools today have smart classrooms which are equipped with various teaching and learning 
aids like Overhead Projector, Interactive Whiteboard, Wireless Microphone, Speakers etc. 
While majority of government run schools across India do not have above discussed basic 
facilities, several U.P board schools are running without basic infrastructural facilities like 
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furniture, electricity, drinking water, and even building. According to the official data of the 
state government, there are 113249 primary schools in the state out of which 81 schools have 
no building, 547 schools do not have washroom for boys, 474 schools do not have washroom 
facility for girls, 1542 schools are running out of water facility, 60221 schools do not have 
electricity, and 53284 schools have no furniture for students. 

Similarly, Out of 45590 upper primary schools, 20 schools have no building, 334 
schools have no washroom for boys, 151 schools do not have washroom facility for girls, 1379 
schools are running out of water facility, 23314 schools are running without electricity service, 
15346 schools have no furniture for students (As per District U-DISE DATA 2016-17). This 
creaking infrastructure hampers students’ learning interests and has turned out to be a major 
factor behind children not being admitted to government and government aided schools 
(Gohain, 2018). Besides, mid day meal served to the students has also often been complained 
to be unhygienic. There has been frequent news regarding contaminated food being served to 
children in government schools. Recent reports claim that Uttar Pradesh tops in food 
adulteration with most number of convictions (4,590) in cases related to food adulteration 
followed by Tamil Nadu (1,311) and Madhya Pradesh (878) between financial year 2017 and 
financial year 2019 (Sharma, 2019). 


(iii) High Teacher-Pupil Ratio 

Right to Education Act (2009) recommends the presence of two teachers for up to 60 
students in class first to class fifth, and at least one teacher for every 35 students in class sixth 
to eighth. In other words, the ratio of 30:1 is to be followed in primary classes, and in upper 
primary classes the pupil teacher ratio should be 35:1. The majority of government primary 
schools in U.P, however, do not seem to be complying with the RTE norms. Surveys claim that 
these schools have classrooms overcrowded with students (Dutta and Bala, 2012; Only 15% 
UP schools meet Pupil—-teacher ratio, 2013; Kohli, 2016; Uttar Pradesh has India’s largest 
population, 2017; India improves student-classroom ratio, 2018; Significant decline school 
numbers, 2018; student-classroom and Pupil teacher ratio, 2018). 

While the average pupil-teacher ratio in the country is comparatively better, it is the 
downsizing staff in the U.P board schools, as discussed above, which is a major reason behind 
student teacher ratio being high. Scarcity of teachers also compels some teachers to teach 
multiple subjects and students of different sections are forced to sit together. Massive 
population of students in classrooms makes it difficult for a teacher to connect with the students 
and make individualized focus. Students too do not participate in learning which deteriorates 
the overall classroom environment. 


1.2. Literature review: Pedagogical Approach in Teaching English in U.P. Board Schools 


English is taught as a compulsory subject from primary classes across India besides being used 
as an official language in government proceedings alongside Hindi. A synthesis report of Sarva 
Siksha Abhiyan says that there are two major objectives of teaching English at primary level; 
(i) to make students attain basic proficiency in English as they acquire in natural language 
learning, and (ii) to develop the language into an instrument for knowledge acquisition. In U.P 
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board schools, oral English is introduced in class I and written English in class III (Dutta and 
Bala, 2012), which later becomes optional from class IX onwards (Choubey & Verma, 2013). 


In spite of being given the teaching of English as a compulsory subject from early school age, 
students of U.P. board schools struggle to communicate well in English as the students of 
private schools do. This is largely because of the outdated approach with which English is 
taught in government schools. But, it is not only the pedagogical approach of teaching English 
that is responsible for making students confident and fluent in communication in English, there 
are several other factors as well that are responsible for making one’s career and overall 
personality development. 


However, the above discussed issues are relevant not only valid for the primary and upper 
primary schools of U.P. rather for the schools of this entire country. The following points draw 
a comparison between government run schools and English medium private schools in terms 
of pedagogical approaches adopted in teaching English as a second language as well as some 


other factors that play major role in shaping students’ career. 


Government Schools Private Schools 


Follow traditional Grammar translation 
method of teaching English as a second 
language in which students are made to 
memorize the grammatical rules and give little 
or no attention to speaking and listening. 
Students’ native language is used as the 
medium of instruction in classroom. 


Teachers depend too much on textbooks and 
do not move beyond it. For example, reading a 
poem line by line rather than describing the 
intent of the poet and consequently fail to 
make students feel and appreciate its content. 

Students just copy the text written on the board 
and rarely interact with teachers or ask any 
question. In other words, learners are passive 
in the class where teacher centered approach is 
followed. 

Very few or no extra-curricular and social 
activities are conducted. Consequently, in 
spite of receiving regular academic teaching 
students remain stagnant and could not add 
anything to their personality development and 
skill enhancement. 


Keeping pace with the demands of the current 
market, give more emphasis to spoken English 
and adopt communicative approach in which 
students are encouraged to speak in English as 
much as possible. 

Target language (English) is being employed 
to carryout conversation to inculcate English 
in the daily life of students. 


Teachers use textbooks only as a tool and 
explore the concept by themselves to make 
sure that students conceptual 
understanding of the contents being taught. 


acquire 


Most of the students take part in the learning 


process by interacting frequently with their 
teachers to clear doubts on topics being taught 
to them. 


Ex-curricular and personality development 
activities like skit, extempore, quiz, essay 
writing etc. are organized frequently which 
contribute much in shaping — students’ 
personality and soft skills. 
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Students generally belong to lower strata of | Most of the students come from above middle 
society with weak educational as well as | and elite class with parents and guardians also 
economical background of their parents and | being educated and aware of the current 
guardians. Hence, students do not get proper | educational needs of their children. Thus, 





guidance in choosing best career options. their mushrooming starts with early age. 

Rare or no field trips and industry visit. Students are often taken for field trips to 
make them explore the world outside the 
classroom. 

In frequent tests are conducted resulting to Tests and exams are conducted regularly to 

improper assessment of students’ academic track the progress and learning outcomes of 

progress. students. 





Having discussed the above mentioned difference between the public and private schools it is 
in no means meant that government schools do not provide quality education at all, and/or 
students of these schools cannot have a bright career like their private schools counterparts do, 
nor can we say that education being imparted in all private schools meet the required standards. 
It is in most of the cases, the above made bifurcation finds its justification. With the 
mushrooming of low budget private English medium schools, English medium schooling has 
become easily accessible for many underprivileged children coming from economical weaker 
section. But this accessibility hardly seems to be providing the quality of education as per the 
English medium standards. Reports suggest that most of the teachers teaching in such schools 
are recruited in low salaries and are inadequately trained. Hence, except teaching students 
through prescribed books, they do not have innovative ideas to contribute to the overall 
personality development of students. Consequently, despite studying in English medium 
schools, students fall short of acquiring communicative skills and other basic life skills (Private 
Schools not providing, 2017; Private School Teachers Need More Training, 2017) 


1.3. Research questions 

1. What are the views of students of U.P. board schools regarding learning English as a 
second language? 

2. What are the views of students of U.P. board schools regarding infrastructural facilities 
in their schools? 

3. What are the views of students of U.P. board schools regarding teaching pedagogy 
followed by their English teachers? 

4. What are the views of students of U.P. board schools regarding quality of assessments 
taken in their schools? 


2. Method 


The present study is based on qualitative research which aims at analyzing the pedagogical 
approach adopted in U.P. board schools. 
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2.1. Sample / Participants 


Out of total 1756 primary schools and 734 upper primary schools of Aligarh district (As per 
District U-DISE DATA 2016-17), a sample of 78 students of Upper primary level from 5 
different Upper primary schools was selected through purposive sampling technique which 
constitute population of the study. 


2.2 Instrument(s) 


The instrument for the purpose of data collection for the present research work comprises of a 
series of semi-structured interviews. 


2.3 Data collection and analysis 


The researchers developed a semi-structured interview schedule for the purpose of study. The 
interviews were done individually as well as in group. The questions were closed — ended in 
which they needed to reply with either Yes or No. The responses were recorded and later 
analyzed. 


3. Results 


The summary of the responses to each question is discussed below 


Do your teachers check your work | 56 22 
in your note book regularly? 

Does your English teacher teach | 7 a 
English through translation? 














2 
Does your’ English _ teacher | 67 11 
encourage you to write answers in 
your own words? 


Zaki and Khan / 2021 69 





Do your teachers encourage you 
to speak in English 





Is grammar given more 
importance in your school than 
functional English? 





Do your teachers ask questions 
after they teach you? 





Does your English teacher explain 
the stories and poems in English? 





Does your English teacher talk to 
you in English? 





Do your teachers other than 
language teachers (such as 
science, social science, 
mathematics,) talk to you in 
English 





Is Grammar given more 
importance in your school than 
functional English 





Analysis of the research data 


Some major takeaways from the primary data are discussed below: 


e Out of the total 78 respondents 68 (87%) revealed that Grammar is given more importance than 
functional English in their schools. 
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e 67 respondents (86%) said that English teachers rarely communicate with them in English. She/he 
speaks English only while reading English texts. 

e 75 respondents (96%) replied that their teachers taught them English through word for word 
translation. 


e Only 8% of the total respondents said that their English teachers explain them stories and poems in 
English while others said that their teachers use native language to explain the poems and stories. 


e However, 71 respondents (91%) revealed that teachers encourage them to speak in English. 


The researchers besides the interview also took a test of some of the skills of English to assess 
the linguistic competence of the students. The test comprised of all the four language skills of 
English i.e. listening, speaking, reading and writing (LSRW). In writing section many students 
made spelling mistakes but found to be better at Grammar. More than 90 percent of the students 
struggled to express themselves in oral English. They also made pronunciation mistakes. In the 
reading section, the students were asked to read a paragraph in English. Out of 78 students, 
hardly 15 students could read the paragraph with ease whereas others were struggling a lot. 
However most of the students were found to be better at listening. 52 students were able to 
respond immediately after they had been asked questions in English while for others the 
researchers had to repeat the same question in their mother tongue i.e. Hindi or Urdu. Hence, 
on the basis of test and the primary data, the researcher has accepted all the three hypotheses. 


4. Suggestive Measures 


The detailed analysis of the related literature and news reports on U.P. board schools, and the 
primary data received from U.P. board students have enlightened the researcher towards certain 
measures that are needed to be implemented to better the pedagogical practices specially while 
teaching English as a second language as well as the overall standard of U.P. board schools. 


e Besides the traditional methods of teaching English as second language such as Grammar 
translation method (which is most used in government schools of UP as revealed in the 
primary data), Audio-Lingual method, Direct method (which is mostly followed to develop 
oral fluency in English) etc; language teachers and trainers are strongly advised to review 
following language teaching methods suggested by pedagogists, which are believed to 
include multi-lingual practices more effectively. These methods include; 

(i) Translanguaging- A method in which multilingual speakers are allowed to use their 
native languages as an integrated communication system. 

(ii) Cooperative language learning (CLL) - A group of learning activity organized so 
that learning is dependent on the socially structured exchange of information 
between learners in groups and in which each learner is held accountable for his or 
her own learning and is motivated to increase the learning of others (Olsen and 
Kagan, 1992:8). One of the prime features of this method is that it gives students the 
experience they need for healthy social, psychological, and cognitive development. 

(iii) | Content Based Instruction (CBI) - In this method, teaching is organized around 
the content or information that students will acquire, rather than around a linguistic 
or other type of syllabus. 
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(iv) Task Based Language Teaching (TBLT) - It is based on the use of task as the core 
unit of planning and instruction in language teaching. The basic elements of this 
method are purposeful activities and tasks that emphasize communication and 
meaning. 

e There is an urgent need to aware people about the importance of education and bringing 
children to schools. A report published in a leading newspaper reveals that there has been 
3.2 lakh less enrolment in U.P. board schools in the session-2018-19 as compared to the 
session 2017-18 (Lavania, 2018). In spite of conducting a door-to-door survey and 
convincing parents in slum and other areas to send their wards to schools under the 
campaign “School chalo Abhiyan”, the reduced enrollment of children in schools has 
definitely raised certain questions on the success of such schemes. Thus, government should 
also keep a check whether its schemes are implemented on ground properly. 

e There is a need of trained teachers who can adopt new approaches of teaching English as a 
second language. 

e The learner centered teaching environment should be implemented so that learners can 
freely use the target language in the classroom. 

e Extra-curricular activities should be conducted frequently so as to imbibe some soft skills 
specially communication skills, team management, basic manners, and work ethics. 

e It is a general consensus among people especially in U.P that government schools cannot 
provide latest curriculum to their children. So those who can afford the private schools can 
leave no stone unturned for providing education to their child but those who belong to below 
middle class are compelled to put their children in government schools. Therefore, the 
timely revision in the overall curriculum, specially the syllabus, the central part of 
curriculum, is the need of the hour. 

e [tis a known fact that the command of English language is the most desired skill in today’s 
world. To keep pace with the globalization, we must prepare our children to compete with 
the most advanced global generation and for this English is the basic required source. Thus, 
English teachers should follow communicative approach to teaching English as a second 
language in which the focus is more on meaning and less on form. 

e Students should be encouraged to speak in English as much as possible even if they make 
grammatical errors and should not be interrupted during their speech. However, it does not 
mean to sideline the accuracy completely, it is just a focus on one of the skills (Brumfit, 
1984). 

e Teachers can work on their accuracy, in some other class, focus on fluency and accuracy 
should be invested separately. 


5. Discussion and Conclusion 


The present study was aimed at examining (1) the pedagogical approaches followed by English 
teachers of U.P board primary schools, and (ii) the linguistic as well as communicative 
competence of students of U.P board schools. Besides, the study attempted to explore the 
curriculum of these schools to find out gap between U.P board and private schools of Aligarh. 
The major takeaways from the responses highlight that the Grammar translation method is still 
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being followed in the U.P. board schools in which there is little or no effort made to train 
students in the spoken or communicative English. Majority of students accepts that their 
teachers encourage them to speak in English however, 90 percent of the total respondents, as 
mentioned in the above data revealed that only English teacher communicate with them in 
English that too while teaching whereas other teachers communicate with them in students’ 
mother tongue. This is one of the most prominent factors behind students of the U.P. board not 
getting enough exposure of communicative English even in the schools. Therefore, it has 
become an urgent duty of the government as well as the concerned authorities of the U.P. board 
schools to make sure that students receive quality education. If we are able to make the above 
mention modifications, we will definitely contribute towards making students of U.P. board 
schools proficient in English and raising the overall academic standard. 
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taste, cognitive semantics, Taste is one of the five basic human senses with which we perceive the 
conceptual metaphors, flavour of the food that we consume. The adjectives used to describe the 
prototypically, polysemy perception of flavours or tastes are called taste-terms or taste descriptive 


adjectives. The perception of the taste is generally different in different 
languages. The present paper aims at the analysis of the perception of taste- 
terms in Gujarati. Unlike English, there are seven main tastes in Gujarati, 
represented by seven different taste-terms. In this paper, the primary focus is 
on conceptualising the taste-terms with reference to its prototypical domain, 
i.e. the usage of taste-terms to describe various flavours of the food. 
However, among these seven taste-terms, all are not limited to their 
prototypical usage. In Gujarati, there are a few taste-terms such as sweet, 
spicy, salty, bitter, etc. which have an extended meaning, when used 
metaphorically. e.g. /mit*i dgib'/ ‘sweet tongue’ or /kodui dsib'/ ‘bitter 
tongue’, conveying the positive attribute of politeness and negative attribute 
of impoliteness, respectively. Hence, the analysis of the extended meanings 
of the taste-terms derived from the metaphors present in the language, and 
their conceptualisation in various domains is examined in this paper. These 
extended meanings are derived from the conceptual metaphors available in 
the language, where the source domain is a taste descriptive adjective while 
the target domain vary. The target domains are established based on the 
metaphors observed in the conversational use of the language. There hasn’t 
been much work done in this area, and especially, in Indian languages. This 
paper is divided into four sections. The first section consists of an 
introduction. The second section deals with the conceptualisation of the 
taste-terms with reference to the cognitive model of prototypicality. In this 
section, the human perception of taste is described, contributing with an 
insight about how the taste-terms are used prototypically. This section also 
discusses the Cognitive Semantic analysis of the extended meanings, derived 
from the conceptual metaphors, of all the taste-terms available in the Gujarati 
language. The third section examines the polysemic behaviour of the taste- 
terms. In closing, the fourth section provides the conclusion. 


1. Introduction 


Taste terms describe the taste or flavour of the food. In Gujarati, there are seven basic taste terms while 
in English, there are five, namely, Sweet, Bitter, Sour, Salty and Umami. Among these five basic taste 
terms, Gujarati shares four taste terms with English. These taste terms are /mif"u/, /gol|ju/ and /mod*us/ 
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for ‘sweet’, /koduu/ ‘bitter’, /k*atu/ ‘sour’ and /k*aru/ ‘salty’. The perception of these taste terms is 
common in both languages. However, three taste terms are limited to Gujarati, and its perception in 
English is exceptionally different than in Gujarati. These taste terms are — /tik"u/ ‘spicy’, /tucu/ 
‘astringent’ and /molu/ ‘tasteless’. In English, /tik"u/ is conceptualised as ‘pungent’ which can be 
described as a taste that yields a sharply strong sensation. Ginger is described as pungent in taste. But, 
in Gujarati, /tik"u/ is conceptualised as “spicy”, i.e., a hot taste that yields a burning sensation in the 
mouth. Spicy, as a taste descriptive adjective, usually describes food with many spices and in western 
countries the use of spices is minimal. Hence, spicy is not considered as one of the main taste terms. 
While in India, spices are commonly used in the food, making ‘spicy’, as one of the main taste terms in 
the majority of the Indian languages. Hence, in this paper, we will use the taste term ‘spicy’ instead of 
pungent. Spicy, as a taste descriptive adjective, can be used for the food having chilly, black pepper, 
garlic, etc. Astringent /turu/, as a taste, can be described with an example of alum and green or raw 
banana. The last taste is a taste term for conceptualising tasteless food. /mo|u/ in Gujarati is used to 
describe food that doesn’t have any taste as such. Salt is a key ingredient in Gujarati cuisine and the 
food which have less or no salt in it, is perceived as tasteless. The taste terms described above are further 
analysed in the present paper. Note that in the Gujarati language, all the taste terms agree with the gender 
of the noun they modify. 


This paper is divided into two main sections. In the first section, the prototypical usage of taste terms is 
discussed. In this section, the taste descriptive adjectives are analysed based on two categories of food, 
viz. natural and manmade. Natural food items incorporate food items that we consume in their natural 
form e.g., fruits, nuts, honey, etc while manmade food items include food that is cooked or processed 
by us. The second section deals with the cognitive-semantic analysis of the extended meanings of taste 
terms. The metaphorical usage of taste terms contributes to extended meanings. The metaphors analysed 
in this paper are the ones that are often used in day-to-day conversations. In the cognitive linguistic 
view, metaphor is defined as understanding one conceptual domain in terms of another conceptual 
domain. A convenient shorthand way of capturing this view of metaphor is the following: 
CONCEPTUAL DOMAIN A IS CONCEPTUAL DOMAIN B, which is what is called a conceptual 
metaphor (Kovecses, 2010, p.04). The use of small capital letters indicates that the particular wording 
does not occur in language as such, but it underlies conceptually all the metaphorical expressions listed 
underneath it (Kovecses, 2010, p.04). A conceptual metaphor consists of two conceptual domains, in 
which one domain is understood in terms of another (Kovecses, 2010, p.04). The two domains that 
participate in conceptual metaphor have special names. The conceptual domain from which we draw 
metaphorical expressions to understand another conceptual domain is called the SOURCE domain, 
while the conceptual domain that is understood this way is the TARGET domain (Kovecses, 2010, p.0 
4). In this paper, the SOURCE domain is the taste descriptive adjective or the taste term while the 
TARGET domain is the noun that is described by the taste term. These metaphors are grouped based on 
semantic regularity observed in the TARGET domain. If the TARGET domain comprises nature, 
language, temper, laughter, etc. of an individual, then the metaphors representing these traits are grouped 
to form a single metaphor - PERSONALITY IS TASTE. In this paper, six such metaphors are discussed, 
namely, PERSONALITY IS TASTE, EXPERIENCE IS TASTE, ACTION IS TASTE, RELATION IS 
TASTE, MUSIC IS TASTE and BELIEF IS TASTE. All these metaphors comprise various TARGET 
domains which are analysed in detail. The third section examines the metaphorical extension in terms 
of the semantically polysemous behaviour of the taste terms. 


The motivation behind analysing the taste terms is to study how we conceptualize the taste terms with 
respect to other domains. It also provides an insight into the importance of taste as a sense and its impact 
on our perception of the world around us. 
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2. Analysis of Taste Terms 
2.1. Prototypical Usage of Taste Terms 


In this section, the taste terms are analysed based on their prototypical usage. The prototypical usage of 
taste terms is analysed on the basis of their application for describing natural and manmade food items. 


Gujarati people are usually considered to be fond of sweets and their love for sweets is also reflected in 
their language. The Gujarati language has three taste terms for ‘sweet’ which are - /mif"u/, /mod*ur/ and 
/golju/. 


The taste term /kaduu/ ‘bitter’ is used for naturally bitter food items such as fenugreek, cocoa beans, 
coffee beans, aloe vera, bitter-gourd and leaves of a neem tree. For example, /kadvu ktwarpat"ti/ “bitter 
aloe vera’, /kodvu karelu/ ‘bitter bitter-gourd’, etc. While for manmade food, coffee, dark chocolates, 
cooked bitter-melon/bitter-gourd and fenugreek leaves, as well as items that contain fenugreek leaves 
or fenugreek seeds in higher proportion are described by taste terms /koduu/ ‘bitter’ e.g., /kodui kofi/ 
‘bitter coffee’. /k"aftu/ ‘sour’ is used for natural food items such as tamarind, raw mango, gooseberry, 
orange, lemon, lime, mandarin, etc. e.g., /k"afi keri/ “sour mango’, /k"afi amli/ “sour tamarind’, and so 
on. Under manmade food items, fermented foods, yoghurt or curd, cooked raw mango, orange juice, 
lemonade, etc. are described as /k*atu/ ‘sour’. The taste term /k"aru/ ‘salty’ is used to describe natural 
salts such as edible rock salt e.g., /k"aru mit*u/ ‘salty salt’, while for manmade food items, any cooked 
or processed food item which has more salt in it, is described as /k®aru/ ‘salty’. However, if the 
proportion of salt is less or the salt is lacking in any manmade food item, then that food item will be 
described by the taste term /mo|u/ ‘tasteless’. Natural food items like oatmeal and fruit like water 
chestnut are described as /molu/ ‘tasteless’. /ik"u/ ‘spicy’ is used to describe natural food items such as 
chilli, pepper, garlic, radish and ginger. Any manmade food item that contains more spices, especially 
chilli, is described as /tik"u/ ‘spicy’. Hence, the usage of all the taste terms for manmade food items 
depends upon the flavour of the ingredients used in a particular manmade food item and their proportion. 
/tucu/ “astringent” is the taste term limited to the description of natural food items such as potash alum, 
green or raw bananas and ridge gourd. Moreover, the after taste of java plum and gooseberry is described 
by this taste term. 


These were the prototypical uses of the taste terms describing tastes or flavours in terms of natural and 
man-made food items. Now we will investigate the metaphorical uses of the taste terms. 


2.2. Metaphorical Meanings of Taste Terms 


The taste descriptive adjectives, as its name suggests, are commonly used to describe the taste or flavour 
of the food, as discussed in section 2.1. However, these adjectives obtain a metaphorically extended 
meaning when they are used to describe something other than the taste or flavours of the food. In this 
section, we will analyse how the metaphorical application of taste terms reflect their extended usage. 


2.2.1, PERSONALITY IS TASTE 


The first conceptual metaphor of taste - PERSONALITY IS TASTE, emerges when an individual’s 
personality is represented by taste descriptive adjectives. Personality is the set of psychological traits 
and mechanisms within the individual that are organized and relatively enduring and that influence his 
or her interactions with, and adaptations to, the intrapsychic, physical, and social environments (Larsen 
& Buss, 2008, p.04). Hence, when such traits in an individual, are described by taste terms, the cognitive 
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approach towards the personality is the same as that towards the food and its flavours. Hence, in this 
conceptual metaphor, PERSONALITY is the TARGET and TASTE is the SOURCE. 


The first personality trait we will discuss here is the — ‘nature of an individual’. Gujarati allows us to 
describe the nature of an individual by three different taste terms i.e., sweet, spicy and salty. The 
metaphorical usage of the taste term ‘sweet’ can be exemplified with the metaphor - /mit'o svob*av/ 
‘sweet nature’. Here, the taste “sweet” is related to the goodliness of an individual, conveying the 
meaning of humble, kind or good nature. Sweet is generally considered as a pleasing taste and hence it 
is related to the positive traits of an individual’s personality. Contrarily, spicy and salty tastes have 
negative traits associated with them. Hence, the metaphorical meanings of the metaphors /tik"o svab"av/ 
‘spicy nature’ and /k"aro svob*av/ ‘salty nature’ are fierce nature and harsh, spiteful or malicious nature 
respectively. “Spicy” and ‘salty’ are perceived as strong tastes as compared to ‘sweet’. They are not 
considered pleasing and hence the negative perception of the taste is reflected in the cognitive aspect of 
the language. Moreover, these three taste terms are most commonly used to describe different 
personality traits in the Gujarati language. 


The second personality trait. “language and words used by an individual’ is generally described by the 
three taste adjectives — sweet, bitter and spicy. Analogous to the first personality trait, here also, the taste 
term ‘sweet’ 1s associated with the positive attribute of the personality. This can be exemplified with the 
following metaphors: /mif"i vani/ “sweet language’; /mit'i dgib’/ “sweet tongue’; /mit*o ovads/ “sweet 
voice’; /mit'a bolu/ ‘swect speaker’. In these metaphors the following meanings are conveyed: polite 
for ‘sweet language’ and “sweet tongue’, soothing or melodious for “sweet voice’ and honey-tongued or 
sweet spoken for “sweet speaker’. Besides spicy, bitter 1s also an unpleasant taste and hence, bitter and 
spicy will relate to the negative attribute of this trait as follows: /kodui vani/ “bitter language’: /kodui 
dib’/ “bitter tongue’: /kodva fobdo/ ‘bitter words’; /tik"a bol/ “spicy words’. Here the meanings 
conveyed by bitter and spicy are — hurtful, unpleasant or unkind. Moreover, in the metaphor /mif"i c®o.i/ 
“sweet knife’, although the taste term used to describe the trait is sweet, the implication is negative. 
Analysing the gloss or the literal meaning, we can say that, even though the SOURCE, the taste 
descriptive adjective, have positive implication, but the TARGET 1.e., the knife, which is described by 
the taste descriptive adjective have negative implication. The function of the knife contributes to the 
metaphorical meaning, which is either to cut or to penetrate. A knife can wound or damage someone, 
causing them physical injury. But here, the metaphorical implication of damage or injury is related to 
an abstract concept of respect, i.e., the damage caused to the self-respect of an individual by insulting 
him/her with the use of disrespectful language leading him/her to emotional pain and injury. Hence, the 
implication of this metaphor is negative. 


Furthermore, Gujarati provides us with the liberty to use “sweet” and “spicy” taste descriptive adjectives, 
to describe the third trait i.c., “how a person looks at an individual’, as follows: /mit*i nodgor/ “sweet 
eyesight’ and /tik"i nodgor/ ‘spicy eyesight’. For this trait also, taste term sweet is associated with the 
positive trait, conveying the meaning of affection or love for “sweet evesight’, while taste term spicy is 
related to negative trait, conveying the sense of sarcasm, as in a sarcastic look for “spicy eyesight’. 


The next personality trait, i.e., “an individual’s temper’ can be described by the taste descriptive 
adjective — spicy, which can be demonstrated with these metaphors as follows: /tik"u mogods/ ‘spicy 
brain’; /tik"o midgad3/ “spicy mood’. As discussed above, spicy is usually associated with the negative 
traits of an individual’s personality like petulant or bad-tempered for “spicy brain’ and fiery temper for 
“spicy mood’. 
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Moreover, a smile which is a positive trait or attribute of human personality is described by the taste 
descriptive adjective — sweet, conveying the sense of pleasantness as in the metaphor /mif'u hasjo/ 
“sweet laughter’. 


In Gujarati, a person’s courage and passion or spirit are related to the taste of salt. When the proportion 
of salt is less than required, the food becomes tasteless and hence, we cannot enjoy its flavour. Hence, 
saltless food or food with less salt loses its flavour. Flavour usually, is considered as the spirit of the 
food. This concept is reflected in cognitive modelling of the metaphors - /mo|o manps/ ‘tasteless person’ 
and /mitha vinano (mangs)/ ‘saltless (person)’, when the taste descriptive adjectives — tasteless and 
saltless are used metaphorically to describe a person’s spirit and timidity respectively. In these 
metaphors, “tasteless person’ implies that a person is spiritless and “‘saltless (person)’ implies that a 
person is coward or timid. 


2.2.2. EXPERIENCE IS TASTE 


When various experiences that an individual can have throughout his/her life are described by taste 
descriptive adjectives, the metaphor - EXPERIENCE IS TASTE emanates. Experience can simply be 
defined as the lessons we learn or the knowledge we gain from the activities we do. Here, by activities, 
we are referring to all the daily activities along with various impromptu occurrences and events that we 
encounter during our whole life which serves as an exemplar in our life. 


Now the range of experience of which we can become aware is vast. It includes not only all the ordinary 
dimensions of human life, (perception, motion, memory, imagination, speech, everyday social 
interactions), as well as cognitive events that can be precisely defined as tasks in laboratory experiments, 
(for example, a protocol for visual attention), but also manifestations of mental life more fraught with 
meaning, (dreaming, intense emotions, social tensions, altered states of consciousness) (Depraz, Varela, 
& Vermersch, 2003, p.02). From this vast range of experiences, in this paper, we are focusing on the 
experiences gained by the ordinary dimensions of human life and experience as manifestations of mental 
life more fraught with meaning i.e., dreams, sleep (altered states of consciousness), pain (intense 
emotions) and various sensations. 


When the experience is described by a taste descriptive adjective, the cognitive approach towards the 
experience is the same as that towards food and its flavours. Thus, in this conceptual metaphor, 
EXPERIENCE is the TARGET and TASTE is the SOURCE. 


To analyse the metaphor - EXPERIENCE IS TASTE, we have categorized various experiences into 4 
semantic categories. In the first category, the TARGET holds its prototypical meaning which is of 
undergoing any kind of experience in day-to-day life. In Gujarati, experience is described 
metaphorically by two taste descriptive adjectives i.e., sweet and bitter as follows: /mit'o onub*sv/ 
‘sweet experience’; /kodvo onub"9sv/ “bitter experience’. Analogous to that of the previous metaphor, 
here also the taste descriptive adjective — sweet has a positive implication i.e., of a pleasant experience 
while the taste descriptive adjective — bitter has a negative implication 1.e., of an unpleasant experience. 


The second, third and fourth semantic category exhibits the experience as manifestations of mental life 
more fraught with meaning. In the second category, the TARGET indicates the social tension arising 
from social interaction which can be elucidated from the following example: /kodui tika/ “bitter 
criticism’. Here, the SOURCE and the TARGET are semantically complementary to each other. The 
TARGET holds a negative implication, as criticism is an expression of disapproval or the judgement of 
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someone’s failure. Moreover, the taste descriptive adjective used to describe criticism is bitter. As 
discussed in the previous metaphor, ‘bitter’ holds a negative implication in metaphorical 
conceptualization, in turn conveying the metaphorical meaning - acrid criticism. Hence, in Gujarati, this 
metaphor is used to intensify the negative implication of the TARGET i.e., Experience of Criticism. 


In the third category, the TARGET indicates various states of consciousness such as sleep and dreams 
as demonstrated in the metaphors - /mif'i t:g"/ “sweet sleep” and /mi{"u svopno/ ‘sweet dream’. In 
Gujarati, the taste term ‘sweet’ is used as the taste descriptive adjective for the states of consciousness. 
Here also the taste descriptive adjective — sweet has a positive implication of restful/balmy (sleep) and 
pleasant (dream). However, in the metaphors - /mift'u dorod/ ‘sweet pain” and /mif'i jad/ “sweet 
memory’, intense emotions of pain and nostalgia are described by the taste term — sweet. In Gujarati, 
these metaphors are generally used in the context of love. The pain that is tagged along with love is 
generally called sweet pain. There can be many possible causes of this pain. Here, we will consider the 
situation which can explain both the metaphors that describe emotions like pain as well as nostalgia as 
follows. The pain that two lovers endure when they are separated by distance can be described as sweet 
pain. When they feel lonely, they recollect the memories that they have made with each other. These 
memories make them feel nostalgic but also induce a feeling of hope for their reunion. The pain that 
they feel because of the separation and nostalgia inflict pleasure, as these emotions ensure them that 
there is someone who loves them and is waiting for them. Hence, although the implication of the 
TARGET is negative, the implication of the metaphor is positive, making the SOURCE and the 
TARGET semantically incongruent with each other. 


In the fourth and the last category, experience bears the meaning of various sensations experienced by 
the human body. In Gujarati, the sensations that can be described by the taste descriptive adjectives are 
itching sensation which can be exemplified by the metaphor - /mit'i k*3dzval/ ‘sweet itching sensation’ , 
olfaction as in the metaphor - /mit*i sug3d"/ ‘sweet smell’ and sunburn (burning sensation of skin 
because of the sun) as in the metaphor - /tik"o tap/ “spicy heat of the sun’. The pleasant smell such as 
the smell of a flower or an incense stick is generally described with the taste descriptive adjective — 
sweet in the Gujarati language as demonstrated in the metaphor - /mit"i sug3d"/. The SOURCE and the 
TARGET are semantically incongruent in the metaphor - /mi{"i R35 dzval/ . Usually, an itching sensation 
is perceived as irritating, which has a negative implication. But here, the itching sensation is described 
with the taste descriptive adjective — sweet, conveying a positive implication of pleasant itching 
sensation. In the Gujarati language, the itching sensation that provides a sense of relief after scratching 
the affected area is conceptualized as a “sweet itching sensation’. For example, when a skin abrasion is 
about to heal, we usually feel an itching sensation around the wounded skin. That itching sensation 
provides a sense of relief after scratching around the wound, which is conceptualized as a pleasant 
itching sensation. In the metaphor - /tik"o tap/, the heat of the sun that burns the skin is described by the 
taste descriptive adjective — spicy. Here, the SOURCE and the TARGET are semantically 
complementary to each other. In Gujarati, the conceptualization of the heat of the sun as spicy develops 
from the conceptualization of chilli. When we eat spicy food, usually food that contains a higher 
proportion of chillies, we feel a burning sensation in our mouth, likewise, when we are exposed to the 
heat of the sun, we feel the burning sensation on the skin. Therefore, the heat of the sun rays that burns 
the skin is conceptualized as spicy and hence the metaphor ‘spicy heat of the sun’ intensifies the 
description of the heat of the sun, conveying the sense of stinging heat. 
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2.2.3. ACTION IS TASTE 


Action can be defined as the process of doing something, typically to achieve an aim. Here, we are 
analysing three such actions 1.e., quarrel, talk and discussion which, in the Gujarati language, are 
described by two taste terms i.e., sweet and spicy. Hence, the third conceptual metaphor — ACTION IS 
TASTE appears when these actions are described by taste descriptive adjectives as demonstrated with 
the metaphors - /mit"o dg"ogdo/ ‘sweet quarrel’: /mit' vato/ ‘sweet talks” and /tik"i ffortfa/ “spicy 
discussion’. In the metaphor - /mit"o dg'agdo/, the SOURCE — sweet is semantically incongruent with 
the TARGET — quarrel. In general, the quarrel has a negative implication on the relationship of the 
people who are quarrelling. Quarrel also disturbs the peace of mind. However, the taste descriptive 
adjective used here (i.e., sweet) to describe quarrel, has a positive implication. That is because, in the 
Gujarati language, the quarrel between lovers (also known as “lovers” quarrel”) or friends or any two 
people, that eventually strengthens the bond and solves all the misunderstanding between them, leading 
to an even stronger bond is conceptualized as a positive action. So, although a quarrel has a negative 
implication, here it has a positive outcome. Hence, in this example, the implication of the TARGET is 
reformed by the SOURCE. In the metaphor - /mit"i vato/, although the SOURCE -— sweet and the 
TARGET - talks, have a positive implication, the implication of the metaphor is negative. These types 
of metaphors are employed to instigate sarcasm in the conversation. Here, the literal meaning of the 
metaphor is “sweet talks’ which metaphorically describes a person who overplays the sweet talks. The 
meaning of this metaphor is flattery, blandishment or blarney, i.e., excessive and insincere praise carried 
out in a conversation to benefit oneself. This meaning is imparted when it is used in an appropriate 
context, for example, Sheena sweet-talked Rina because she wanted her notes. In the metaphor - /tik’i 
fortfa/, the SOURCE 1.e., the taste descriptive adjective — spicy, is intensifying the TARGET - 
discussion. When we begin to have spicy food, at first it tastes flavourful, gradually it becomes an 
addiction and eventually, if we don't cut off the spicy food, it will weaken the digestive system. 
Similarly, when the discussion begins, everyone wants to prove that what they are saying is the ultimate 
truth. This intensifies the discussion and after a point, none will be willing to capitulate. Ultimately, 
everyone who is indulged in the discussion starts judging each other and at the end, they are left with 
grudges for each other. Hence, the heat in the discussion weakens the relation. This weakening of 
relation is compared to the weakening of the digestive system caused by excessive spicy food. Hence, 
this 1s how the metaphor “spicy discussion’ is conceptualised as heated discussion/heated argument in 
the Gujarati language. 


2.2.4. RELATION IS TASTE 


The metaphor — RELATION IS TASTE appears when the social relations of an individual are described 
by a taste descriptive adjective. In the Gujarati language, two metaphors are widely used where the 
SOURCE is a taste descriptive adjective and the TARGET is a social relation. These metaphors are as 
follows: /mit*o sombond'/ ‘sweet relation’ and /k*ato sombond'/ or /3b3d" ma k*staf/ ‘sour relation” or 
“sourness in a relationship’. In these metaphors, the TARGET, i.e., relation, is described by two taste 
descriptive adjectives, namely, sweet and sour. When a relation is comforting and full of love, it is 
described by the taste descriptive adjective — sweet. Hence, the metaphor - /mit*o sombond"/ has a 
positive implication. While, when the relationship is toxic or when negative emotions like jealousy, 
distrust, falsity, etc. infiltrate the relation, the relation starts to waver. This type of relation or relationship 
is described by the taste descriptive adjective — sour. As sour is conceptualised as an unpleasant taste, 
the metaphor - /k"ato sombond'/ ‘sour relation’, holds a negative implication, i.c., wavering or 
weakening of the relation. 
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2.2.5. MUSIC IS TASTE 


When music and the voice of a singer or the sound of an instrument are described by a taste descriptive 
adjective, the metaphor — MUSIC IS TASTE, materializes. In Gujarati, only one taste descriptive 
adjective is commonly used to describe the music or the voice of a singer. This taste descriptive adjective 
is [mod"ur] or [mito] ‘sweet’. The SOURCE ic., the taste descriptive adjective holds its positive 
implication in the case of this metaphor. This can be exemplified by the following metaphors: /mod*ur 
songit/ ‘sweet music” and /mod*ur ovads/ or /mit*o ovadz/ “sweet voice’. In both the metaphors, the 
SOURCE complements the TARGET. The metaphors, /mod*ur songit/ and /mod*ur ovads/ or /mit*o 
ovads/ are used respectively when the music of the song is melodious and the voice of the singer or the 
sound of any musical instrument is pleasant or soothing. Hence, this metaphor has a positive implication 
and is typically used for praising. 


2.2.6. TRUTH IS TASTE 


When the truth is described as a taste descriptive adjective, the conceptual metaphor TRUTH IS TASTE 
emerges. This can be demonstrated with the metaphor - /kodwu sotjo/ “bitter truth’. When the truth is 
difficult to accept, admit or bear, it is generally described by the taste descriptive adjective — bitter. 
Bitter, as a taste is perceived as unpleasant. Hence, the SOURCE has a negative implication while the 
TARGET has a positive implication. People usually perceive bitter food as difficult to swallow. Hence, 
in the Gujarati language, the truth which is difficult to swallow, as in, difficult to bear or accept is 
conceptualised as “bitter truth’. In this metaphor, the SOURCE intensifies the TARGET as they are 
semantically incongruent to each other. 


3. Metaphorical Extension in Terms of Polysemy 


In section 2, while analysing the extended meanings of the taste terms derived from the conceptual 
metaphors, we also observed that few taste terms convey more than one meaning. This feature of the 
taste terms of conveying more than one meaning, when used metaphorically, induces Polysemy. The 
probably most widely accepted definition of polysemy is as the form of ambiguity where 2+ related 
senses are associated with the same word (Dabrowska & Divjak, 2019, p.23). In this section, we are 
investigating the polysemic behaviour of taste terms in the Gujarati language and the various meanings 
conveyed by a particular taste term. 


The typically used taste term in Gujarati - /mit*u/ and /mod"uc/ ‘sweet’ (/galju/ is not used 
metaphorically), conveys more than two meanings when used metaphorically. From the metaphors 
analysed in section 2, various meanings that the taste term /mif*u/ and /mod*ur/ ‘sweet’ convey, generally 
have a positive implication e.g., humble or kind for nature in the metaphor /mit"o svab"av/ “sweet 
nature’; polite for language and tongue as in /mif'i vani/ ‘sweet language’ and /mif' dgib"/ ‘sweet 
tongue’ respectively. Mclodious, soothing or pleasing for voice in the metaphor /mif"o ovad3/ ‘sweet 
voice’; honey-tongued or sweet spoken for the metaphor /mit"a bolu/ ‘sweet speaker’: loving or 
affectionate for glance in the metaphor /mit*i nodgor/ ‘sweet eyesight’; pleasant for laughter in the 
metaphor /mif"u hasjo/ ‘sweet laughter’; restful or balmy for sleep in the metaphor /mit*i i:g"/ ‘sweet 
sleep’; delightful for memory in the metaphor /mifi jad/ “sweet memory’ and so on. However, the taste 
term ‘sweet’ can also have a negative implication when used to initiate sarcasm in the conversation. In 
such a case, the sense associated with the taste term ‘sweet’ has a negative implication. e.g., a person 
who can insult or offend anyone with sugar-coated words can be described with the metaphor /mif*i 
chori/ “sweet knife’. Here, the function of the knife contributes to the metaphorical meaning, which is 
either to cut or to penetrate, employing the negative sense to the taste descriptive adjective — sweet. 
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Hence, the taste term /mithu] or /madhur/ “sweet” in the Gujarati language is semantically polysemous 
as it conveys more than two meanings when used metaphorically. 


The taste term /koduu/ ‘bitter’ usually have a negative sense associated with its meaning in a 
metaphorical context. Various metaphorically extended meanings conveyed by /koduu/ ‘bitter’ are as 
follows: hurtful or unkind for language, tongue and words in the metaphors /kaqdui vani/ “bitter language’ 
; /kodui dibh/ ‘bitter tongue’ and /kodva fobdo/ “bitter words’, respectively; acrid for criticism in the 
metaphor /kodui tika/ ‘bitter criticism’; unpleasant for experience as in /kodvo onubhov/ “bitter 
experience’ and the truth which is difficult to accept, admit or bear can be described as /koduu sotja/ 
“bitter truth’. Hence, the taste term /kaduu/ “bitter’ is semantically polysemous. Moreover, all the senses 
related to ‘bitter’ have negative implications because, the taste — /koduu/ ‘bitter’, itself is conceptualized 
as unpleasant. 


The senses related to the taste term /tik"u/ ‘spicy’ portrays intensity as the taste “spicy” is conceptualized 
as an intense taste. The metaphorically extended meanings conveyed by the taste term - spicy are as 
follows: fierce for nature in the metaphor /tik"o svabhav/ ‘spicy nature’; hurtful and unkind for words, 
in the metaphor /tik"a bol/ “spicy words’: sarcastic look as in /tikti nodgor/ ‘spicy eyesight’; petulant 
and fiery temper for the metaphors /tik"o mogods/ ‘spicy brain” and /tik"o midgad3/ ‘spicy mood’ 
respectively; stinging for the heat of the sun in the metaphor /tik"o tap/ “spicy heat of the sun’: heated 
discussion/ heated argument for the metaphor /tik"i ffort{a/ “spicy discussion’. Thus, the taste term /tik"u/ 
‘spicy’ 1s semantically polysemous as it conveys more than one meaning. But also, in general, all the 
extended meanings conveyed by /tik"u/ ‘spicy’ have negative implications. 


The taste terms /k’aru/ ‘salty’ and /mo|u/ ‘tasteless’ are semantically polysemous and they can be 
justified as follows. When the proportion of salt is more in the food, the taste of the food becomes 
unpleasant. Hence, the conceptualization of salt in its metaphorically extended meaning has a negative 
implication, as in harsh, spiteful and malicious in nature in the metaphor /k"aro svob*av/ ‘salty nature’. 
While if the proportion of salt is less or salt is absent in the food, then the food will be perceived as 
tasteless and is metaphorically conceptualized as lifeless. This negative implication of the taste term 
/mo|u/ ‘tasteless’ is reflected in its metaphorical extended meanings as follows: spiritless for a person 
in the metaphor /mo|o manps/ ‘tasteless person’ and spineless, timid or coward in the metaphor /mif"a 
vinano (manps)/ ‘saltless (person)’. 


Hence, in this section, we observed that in the Gujarati language, when the taste terms exhibit 
metaphorically extended meanings, their polysemous behaviour is highly dominated by the perception 
of taste terms and the taste itself. 


4. Conclusions 


The purpose of cognitive-semantic analysis of taste terms is to investigate various applications and 
functionality of taste terms in the Gujarati language. In this paper, we analysed 7 main taste terms of the 
Gujarati language. Initially, we attempted to study the prototypical usage of the taste terms, followed by 
the analysis of their extended meanings via conceptual metaphors. The metaphorical usage of taste terms 
highlighted the human perception towards various tastes and conceptualization of taste terms. Here, we 
observed that nearly all metaphors with the taste descriptive adjective — sweet, have a positive 
connotation attached to it because sweet as a taste, is conceptualized as pleasant. In contrast to that, 
spicy and sour tastes are conceptualized as strong while tastes such as bitter, salty and tasteless are 
conceptualized as unpleasant and hence, generally, these taste terms have negative implications when 
employed in a metaphorical context. Moreover, the taste term — sweet is used extensively in metaphors, 
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followed by the taste term spicy. This reflects the perception of Gujarati people towards the taste terms. 
As most of the Gujarati people are fond of sweet and spicy food, their conceptual metaphors derived 
from taste terms also implement sweet and spicy more than the other taste terms. Furthermore, Gujarati 
cuisine doesn’t include any manmade food of astringent taste and hence /turu/ “astringent” taste term’s 
application is limited to natural food items. Moreover, astringent does not have any metaphorical usage 
in the Gujarati language. Besides, the semantically polysemous behaviour of the taste terms provoked 
by the cognitive models of conceptualization and perception of the tastes and taste terms is also analysed 
in this paper. By analysing the taste terms, we observed how we conceptualize them with respect to 
other domains. Hence, this paper provides an insight into the impact of human perception on taste as a 
sense and its importance in conceptualizing the surrounding world. Since no previous work has been 
attempted on this topic, we endeavoured to explore the domain of taste and taste terms. Further attempts 
to perform a comprehensive analysis of taste terms in Gujarati will be made. 
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Keywords: Abstract: The Case module is a questionable construct in Government and 
Case, ergativity, optionality, Binding theory and more so in the minimalistic thesis. The present paper 
referentiality, minimalism broadly questions the presence of Case in language as “Why case?’ and how 


it effects the knowledge and representation of I-Language. Chomsky (2012) 
also talks about the “the questions of why Case exist at all”. He analyses that 
in minimalism, the Case Filter, as proposed by Vergnaud (1977), still exists. 
However, the case assignment is now case checking/ valuation which is just 
a by-product of operation Agree. Therefore, this paper attempts to delve into 
this ‘case- chaos’ (Sigurdsson, 2012) and proposes an answer for the 
existence of case in the theory of minimalism, drawing on the notion of 
‘referentiality’ (Hinzen, 2014). l also argue on the analysis of how and where 
is Case assigned now, with special reference to optional ergative case with 
empirical evidence from Sylheti, Nepali and Kurtép languages. I claim that 
ergative case in such languages is checked at the CP layer. The CP layer is 
the information structure layer and C is the functional head of it. The system 
uses the CP layer to ground the motivation behind the existence of ‘case’, 
which is referentiality of the DPs involved. This is to be noted that I use the 
term case here as Case, ‘a formal feature’, not case which is ‘a morphological 
category’ (Bobaljik and Wurmbrand, 2009) 


1. Introduction 


The present paper broadly questions the presence of Case! in language as ‘Why Case?’ and how it effects 
the knowledge and representation of I-Language. As Chomsky (2000) claims that in Minimalism draws the 
notion of the Case Filter from Government and Binding theory (Vergnaud 1977, Chomsky, 1981) into its 
system. However, the case assignment in Minimalism is now just a reflex of operation Agree with the 
advent of uninterpretable phi/ ¢p- features in the functional heads of C. T and v. He adds “that reverses much 
of the recent history of inquiry into these topics and also brings out more clearly the questions of why Case 


' By Case (with upper case C), I mean abstract or syntactic case which is assigned/ valued/ checked inside syntax. On the other 
hand, case (with lower case c) is used in instances of purely morphological case. 
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exist at all. The question arises more sharply if matching is just identity, so that Case can never be attracted; 
operations are not induced by Case-checking requirements”. Therefore, this paper tries to attempt to delve 
into this debate to probe the reason of existence of case in language and proposes an answer for the same 
departing from the minimalist enterprise. This papers also attempts to analyse how and where are case 
assigned now, with special focus on the optional ergative case marking with evidence from Sylheti, Nepali 
and Kurtép languages. This is to be noted that I use the term case here as Case, ‘a formal feature’, 
irrespective of its morphological marking. 

The section 2 talks about the leading theories related to case assignment in generative grammar. They are 
primarily Government and Binding Theory (hereafter, GB) and Minimalism which have dealt with the 
question of case assignment and checking. This section also briefly comments on how some scholars have 
dealt with the idea of morphological case in generative grammar. The section 3 elaborates on some of the 
justifications of existing literature on the function of Case in the system. The section 4 introduces Ergative 
case as the part of the system. In section 5, I lay down the rationales behind case assignment. Additionally, 
I show how ergative case (and accusative case) are assigned differently in some languages than the default 
cases, such as nominative and absolutive cases in some languages. The section 6 bears arguments from the 
empirical data from Sylheti, Nepali and Kurté6p to support the theory that is discussed in the section 5. 
Section 7 will conclude my study. 


2. Case in generative grammar 


This section presents the about various theories relating to case assignment in generative literature 
vis-a-vis GB (Chomsky, 1981), Minimalist Program (Chomsky, 1998, 2000 a.o.) and Dependent theory 
of case assignment (Marantz 1991, Bobaljik 2008 a.o.). The next subsection introduces how case is treated 
in GB framework. Chomsky and Lasnik, with Vergnaud (1977) describe the Case Filter/ Case Module as 
a module in GB, where it is proposed that: 


A. Every lexical DP must be assigned Case, and 
B. Ifit is unable to get case, either it should be moved to a case position or be a PRO. 
The Case Filter could explain the ungrammaticality of many English sentences like: 
(1) *I want he to go 
(2) *the book editor insistence on completing the work (Polinsky and Preminger, 2014) 
(3) *I think John to leave 


English has zero case morphology, nonetheless, it surfaces on pronominals. This is a hard evidence that 
morphological case cannot be correlated with the syntactic/ abstract case. The abstract case may or may 
not be overtly seen morphologically. For example, in sentence (1), “he’ is not getting the desired 
accusative case in the structure and it is ungrammatical. Again, the DP ‘book’ in sentence (2) is not 
properly marked by appropriate genitive cases, thus the ungrammaticality of the sentence. In the sentence 
(3), though the theta roles are correctly assigned with the verbs ‘think’ and ‘leave’, the sentence is still 
ungrammatical. The common problem which accounts for the ungrammaticality of the above sentences 
is case assignment. Therefore, Vergnaud (1977) rightly proposed that English, may not have overt case 
morphology like that of Latin, however, it has a case system as elaborate as Latin. The case filter is only 


2 The debate on ‘abstract case’ and ‘morphological case’ is a long standing one. This paper will not comment on that. 
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applicable to lexical DPs and empty category PRO is not subjected to Case Filter. For example, if we 
consider the following tough construction in English (4): 


(4) Johni is easy PROk to please 


Here the two theta roles of ‘please’ are assigned to the lexical item “John” and empty category ‘PRO’ 
from the lexicon and therefore, both the Projection Principle and theta criterion are checked. As stated 
above, PRO does not receive any case here. Instead of case filter, PRO obeys PRO theorem’. PRO is 
ungoverned and caseless*. For example: 


(5) Rashi wanted him to go 


If we compare sentence (1) and (5), the latter is the grammatical one as the pronominal ‘him’ is case 
marked, though it is not a theta-marked complement of the verb. from the above examples and 
explanations, we see that the finite Tense Phrase (TP) plays an important role in case assignment and acts 
as a case assigner in case of Nominative. A nonfinite phrase (IP) is unable to do so. Similarly, the 
accusative case is assigned by the lexical verb phrase (VP). Prepositional phrase (PP) can also be a case 
assigner. In featural decomposition of lexical categories, Chomsky (1981) described both prepositions and 
verbs as containing the feature [-N]. Categories with [-N] features are case assigners. In contrast, the 
featural configuration of TP has no clear picture and Chomsky (ibid) makes no analysis over this. So, to 
summarise, Cases (nominative and accusative) are assigned by government or by spec- head relationship 
in a finite clause. (6) shows this relationship in case of assignment of nominative case. 


NOM 


subj 


Questions against different modules of GB were started to be fueled in the early 90s, specially by Hale 
and Keyser’s introduction of L-syntax and “hidden transitivity’ (1992, 1998, 2002). With the advent of 
Minimalism (Chomsky, 1993, 1995), all the modules of GB were questioned. As a consequence, some 
were discarded as the part of the program and the Case Module was one of them. Minimalism was based 
on two important tenants. They were: 


A. Universal Grammar must corroborate with the “legibility conditions’ or requirements of the CI 
(Conceptual- Intensional) and AP (Articulatory-Perceptual) interfaces and also remove all the 


3 PRO theorem concludes that PRO is both +pronominal and +anaphor (Huang, 1982 a.o.), thus it should follow both condition A 
(An anaphor must be bound in its local domain) and condition B (A pronominal must be free in its local domain) of the binding 
theory. 

4 Although there are much criticism on this theorem, which is not a part of the discussion, here. 
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redundancy in the system (economy). 
B. The architectural principles of human language faculty must be common to all other organic 
system. 


And thus Chomsky (1995) declares that “Generative grammar can be regarded as a kind of confluence of 
long-forgotten concerns of the study of language and mind, and new understanding provided by the 
formal sciences.” On this line, the traditional process of case assignment is replaced, however, “the Case 
Filter still functions indirectly in the manner of Vergnaud’s original proposal, to determine the 
distribution of noun phrases’ (Chomsky, 2000). Chomsky (1995) remarks that the realisation of Spec- 
head relation as structural case is more ‘natural’ than theta marked cases of lexical items, 1.c. inherent 
case*. In early minimalism, Chomsky (1995) ties up structural case assignment to the specifier-head 
relation of NP and AGR, finding it the most ‘natural’ relation. Case licensing is also tied up with 
displacement and movement along with agreement to the spec of AGR heads. To elaborate, contra to a 
noun which is licensed as nominative in the specifier of TP in the GB conception, in the early minimalist 
construct, the case (for example, nominative and accusative) will depend “on the properties of T and V” 
(Chomsky, 1995: 159). He adds that T and V raises to respective Agrs and Case feature is provided T 
and V. The importance of spec-head relationship is schematically shown below in (7). 


(7) 


Spec, Agr,’ 


m™, 


Agro VP 


DP,t we 


V___ pp, 





In (7), according to Chomsky (1995: 159), the case feature is valued in DPs when the related T and V heads 
are raised or moved to the specs of Agr heads. Thus respective case (and agreement) features are valued in 
DPs when the DPs are in spec-head relation in the respective Agr complexes. Thus DP1 gets a nominative 


5 Inherent case is a type of lexical case, which is neither structural case nor purely morphological case. They are related to theta 
roles of the verb and are not assigned via operation 
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case because it is the AgrS complex in relation to T. Similarly DP2 gets an accusative case as a part of 
AgrO complex in relation to V head. The movement of T and V to the respective Agrs makes the spec-head 
relation an important local relation in the derivational space. Current trends however disassociate Case from 
the specifier position, and also from movement. The Agr element is converted the obligatory operation in 
the theory called Agree (Move)°. The DPs carry interpretable phi-features like person, number, gender etc. 
which is named the ‘Goal’ and the functional heads of C, T and v carry the uninterpretable phi features 
which 1s called the “Probe’. See Koopman (2006), where she defends the spec-head relation with empirical 
evidence from internal agreement system of Maasai DP, Long distance agreement in English, long distance 
agreement in Tsez and nominative objects in Icelandic. Comparing the two system is outside the scope of 
the paper. 

The functional head v is a part of split VP hypothesis (Chomsky. 1995, Hale and Keyser 1993, Kratzer 
1996, Marantz 1997), which is different from the lexical VP head V. v verbalizes the lexical root and Spec 
of vP gives theta role to the subject/ external argument, It also gives accusative case to the complement of 
V in the process of operation Agree. Mahajan (2012) talks about a double vP layering. The upper v is the 
voice head and the lower v gives accusative case. 

The DPs with uninterpretable phi features are active for operation once they are merged within the 
computational space from the lexical array and they undergo operation Agree with nearest functional head 
or probe and valuate them. Thus, Agree is a relationship which ‘matches’ features between the probe and 
the goal (Chomsky ,2000, 2001). Agree can be defined as (8): 


(8) 
a can agree with B iff: 
a. @ carries at least one unvalued and uninterpretable feature and B carrics a matching interpretable and 
valued feature. 
b. a c-commands fi. 
c. B is the closest goal to a. 
d. 8 bears an unvalued uninterpretable feature. 


Therefore, Agree fulfills the matching and valuation requirement between the interpretable phi-features on 
a DP (}) with the uninterpretable phi-features on a functional head a. The DP must have an unvalued Case 
feature to keep it active to Agree with the functional head. The DPs only get their Case as a stipulated by- 
product to the operation Agree to v or T. The DPs are frozen for any further computation for the fulfilment 
of Active Goal Hypothesis, after they get their respective Cases. Thus case assignment involves valuation 
of phi-features. So there are no more “case-positions’ in the syntactic tree anymore in minimalism and the 
role of the functional heads are immense 

For a sentence such as “John likes Mary’, Agree operates on the probes “T” and “v’ and goal DPs ‘John’ 
and ‘Mary’ respectively, in the derivation as demonstrated below (9). 


(9) 


6 Merge is the other obligatory operation. 


91 


Banerjee / 2021 


TF 
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As shown in (9), there are two instances of Agree relations. When the probe is v, it agrees with and case 
values the object as accusative. On the other hand, when T is a probe with uninterpretable phi-features, it 
agrees with ‘John’ as a goal with interpretable phi-features and assigns it a nominative case value. Both 
probes look down, find the closest goal in their c-command domains, against which they can value their 
uninterpretable features. The process “case assignment’ is a valuation of an uninterpretable feature in T via 
Agree, as it happens to other common phi-features of person, number and gender. As the nominative is 
assigned due to Agree with T, nominative and agreement has a close relation. 


Marantz (1991) eliminate Vergnaud’s formal feature of case and case filter altogether. He argues that 
“mapping between semantic roles and argument positions, augmented by the subject requirement of the 
Extended Principle” (Marantz 1991: 11) is sufficient to license NPs in argument positions. Based on 
differential subject marking in Georgian, he proposed the Dependent Case theory where he disassociates 
case from structural positions or movement and connects it to the relationship between the DP(s) involved. 
Case assigned in a step by step process via Disjunctive hierarchy. The Dependent Case Theory looks like: 


(10) Case Realization Disjunctive Hierarchy 


1. Lexically governed case 

ii. “Dependent” case (accusative and ergative) 
ill. Unmarked case (environment- sensitive) 
iv. Default case 


In Marantz’s system, case is assigned starting from step (1). If a language does not assign lexical case, it 
will assign (11) the dependent cases and so on. The assignment of dependent cases are parametric. The T+V 
complex can look up to assign ergative case to the DP, or it can look down to assign accusative case. 
Bobaljik (2008) adds on to Marantz’s statement and connects verbal agreement to it. He states that verbal 
agreement occurs in the reverse direction of case realization disjunctive hierarchy. By this he means 
agreement happens first with the DP with an unmarked case. Some languages, such as Nepali may be 
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parametrized to allow verbal agreement with a DP with ergative dependent case. Thereby, Marantz and 
Bobaljik make verbal agreement and case assignment is strictly a morphological process. Baker et al (2008) 
takes a middle path where he proposes that some languages include both dependent case argument and 
agreement-based case simultaneously in case licensing and push the agreement-based structural case into 
Marantz’s Disjunctive Hierarchy. This is shown below, with the agrecment-based structural case underlined 
in (11). 


(11) 
a. Lexically governed case is assigned first 
b. Then dependent case is assigned (either ergative or accusative case) 
c. Agree based case assignment is tweaked in here by the authors, where, according to 
them, structural cases are assigned. 
d. Then comes the unmarked case (nominative/ absolutive case) 
e. Default case is assigned last, to whatever NPs not still marked for case. 


(Baker and Atlamaz, 2014: 5) 


By (11), Baker and Atlamaz propose a dual case system 

Travis and Lamontagne (1992) introduces Case as a functional feature rather than a formal feature. K head 
of phrase KP is used to assign case to the DPs, which it immediately c-commands, and it itself is c- 
commanded by the Probe. K can be described as probe’s copy, as it shares the same features. Case is 
assigned derivationally when goal agrees with the probe through KP. Falk (1997) extends this proposition 
and proposes to take each case as features where English will be [+NOM, +ACC], thus parametric 
difference of case system can be seen in the case category head K of each language. 

This section gives a brief review of the common frameworks of case in literature. Some push case inside 
syntax and some push it to the morphophonological component. In the former case, the functional heads 
and the phi-features play a huge role. However, in the purely morphological theories, the DPs themselves 
become important where they ‘compete’ for case. Yet, neither of these theories clearly answers why the 
concept of case is needed at all. The next section will deliberate on this question. 


3. Why case? 


Pesetsky and Torrego (2011), at the end of their chapter “case” talks about “deeper questions about 
Case’ which can be summarised as “Why case exists at all?’ Case system and alignment systems are 
wide-spread. So, something which is so firmly present in world languages, must have a theoretical ground 
to exist. In GB, case was assigned within the Case Module inside the surface structure. With Minimalism, 
the Case is no more a module in the system. So why case assignment is necessary, which seems to be an 
‘extra’ operation after basic operations ‘Merge’ and ‘Agree’. This seems to violate the minimality 
condition, one of the basic tenets of the minimalist enterprise. According to Hornstein et al (2004) a.o., 
“Case Theory and Theta Theory are linked by the Visibility Condition’, drawing from Aoun (1979) and 
Chomsky (1995). The condition is: 


(12) A DP’s theta-role is visible or interpreted by LF (logical Form) only if it is case-marked or “it 
contains a case- position’. 
Chomsky (1995) approaches the problem by defining chain. 
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(13) {a1 .... an} is a chain as they are co-indexed and the highest element in chain governs or c- 
commands the lowest element. Thus, it forms a link. The chain and the individual elements to be 
visible and checked at LF to avoid a crash, the lowest element at the theta position must be case 
marked. LF cannot interpret DPs without case marking. 

Hornstein et al (ibid) use the following example to define the Visibility condition. 


(14) I met the man;[ that Mary believed t; to be a genius]. 
(15) *I met the man; [ that it was believed tj to be a genius]. 


The sentence (14) is grammatical because of the correct case assignment. The elements ‘man’ and t; form 
a ‘chain’ where ‘man’ is the head and t; is the foot of the chain. t;is the theta position of the verb “believed’ 
and is case marked by TP. Therefore, the DPs will be visible at LF as they are case marked. In the sentence 
(15), the verb ‘was believed’ is passive and t; is not a theta position, thus not case-marked and not visible 
at LF. 

Chomsky also goes on to add that “Case is a relation of XP to H, H is an X° head that assigns or checks the 
Case of XP. Where the feature appears in both XP and H, we call the relation “agreement”; where it appears 
only on XP, we call it “Case”. Talking about checking theory, Chomsky draws out two relations between 
CHL (computational system of human language) and LF where it is stated that the checked or unchecked 
features which are visible at LF are visible to CHL throughout and features invisible to LF are inaccessible 
to CHL once checked. Thereby, the uninterpretable features coming from lexicon inside the syntax cannot 
remain uninterpretable when they enter LF. This will lead to a crash and bad structure. Case is treated as 
one such uninterpretable feature which is attached with the respective DPs. Case features are different than 
phi-features (Chomsky 1995: 256). Phi-features are uninterpretable in their respective functional heads 
unlike case features which are interpretable in the respective functional heads. The reduction of case to case 
features has its own downsides, but the bigger problem is minimalist system does not have Visibility 
Condition anymore. If GB allows case to fulfill Visibility Condition and avoid crash, what is the motivation 
for the minimalist framework to allow valuation of case. Visibility Condition can be reworked into the 
notion of “interpretability at LF’ in minimalist system, connecting CHL to LF. This enterprise categorically 
states that the system neither allows superfluous symbols in the representations nor superfluous steps in 
derivations (Chomsky, 1995: 24). So, case features must be proved to be non-superflous for the system. 
Hinzen (2014) throws an interesting light on the reason and motivation behind the existence of case. Hinzen 
(ibid) distinguishes grammar, case and meaning and argues why combinatorial grammar needs a case. 
Proposals to connect case to theta role in visibility condition made case ‘meaningful’ but PRO and expletive 
subjects pose a problem here. Below are given a couple of illustrations that show how theta and case 
positions are not the same. 


(16) The child melted the ice-cream. 
(17) The ice-cream melts. English 


The DP ‘ice-cream’ receives theme/patient theta role in both (16) and (17). Despite this, the DP receives 
accusative case in (16) and nominative in (17). 

Furthermore, in connection to the relationship between theta and case, PRO does need a theta role but not 
necessarily case. Similarly, expletives do not need theta role but needs a case position. Lasnik (2008) has 
rejected the notion of case always needing theta roles and movement are not for case reasons, but the 
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question still remains how to incorporate that in the system. With evidence from Icelandic, Hinzen 
concludes that inherent case and morphological cases are different when both the sentences (18-19) mean 
the same with different case markings. Examples that Hinzen uses to show the importance of morphological 


cases are: 
(18) hin kastadi steinimum/ — *steininn Icelandic 

she threw stone.the.DAT/ stone.the.ACC 
(19) sie hat den stein/ *dem stein geworfen German 


she has the.ACC. stone/the.DAT stone thrown 
“She threw the stone.’ 


Therefore, case assignment is no more related to only theta positions. Through different existing literature 
it is seen that case assignment depends on many factors like lexical idiosyncrasies (lexical case), the 
characteristic of DP etc. This whole scenario of overlapping is correctly described as ‘case-chaos’ 
(Sigurdsson, 2012). 

He argues that Merge is a ‘maximally generic combinatorial operation’ and Agree is the “cover term for 
whatever grammatical relations we will find’. The lexicon is only the repository of lexical roots with 
features. Hereby, he questions where does the compositional “meaning, arise from in a sentence. As he 
explains, ‘man’ can be “man-meat’, “man-hood’, ‘mankind’ etc. and there are no prior stipulations which 
can be arguments or which cannot be. This is even true for determining parts of speech, which is also syntax 
dependent. Thus, he introduces the notion of ‘referentiality’ and tries to push it as a theory-internal element. 
He argues that grammatical relations, unlike features and lexical items, are interpretable, rationally. In case 
of use of Ist person by a speaker other than the hearer, is interpreted not through features but through 
relations with other elements in the sentence. The referentiality provided by determiners or pronominals 
are directly on nouns, but not grammatical relations or meaning. 

He says that the elimination of phi-features before shipping off the sentence to LF ‘rests on a conceptual 
mistake’ because elimination of phi-features ignores the notion of “grammatical relation’ in a sentence. He 
redefines phase as where ‘the distinctive grammatical meaning of case lies in the formal- ontological 
distinctions that systematically arise as and when words become grammatical arguments that as such enter 
event- and proposition- denoting phrases’. He establishes that theta relations are disjoined from arguments 
and case assignment. According to him, Agree is optional in many constructions (Sigurdsson, 1996) unlike 
case as case 1s obligatory, thus making case a true ‘derivational time-bomb’. 


4. A case of ergative case 


In the last section, we have seen that case can be of different types. In this context, case alignment 
systems of languages can also be of different types typologically. The major varieties’ are: 
a. Nominative-Accusative 
b. Ergative-Absolutive 
c. Tripartite 
In the nominative-accusative system, the subjects of all types of verbs, such as intransitives and transitives, 
are case assigned in a similar way, against the object of the transitive verb. For example: English. In 


7 There are other minor varieties too on which this paper will not comment on. 
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ergative-absolutive languages, the subject of the intransitive/ unaccusative verbs behaves and is case 
marked differently (absolutive) against the subject argument of the transitive verb/ unergative verbs 
(ergative). For example: Basque. In a tripartite system, the subject argument of the intransitive verb, 
transitive verb and object of the transitive verb behave differently from one another. For example: Nez 
Perce. Some Indo-Aryan languages, such as Hindi-Urdu show split-ergative system. In such type of 
alignment system, two alignments co-exist. The nominative-accusative system is seen in imperfective 
system and ergative-absolute system is seen in perfective system. For example: 


(20) ram anil-ko —_ uthaaegaa 
ram.NOM_ anil=ACC life/carry.Fut 
“Ram will carry Anil” (Mohanan 1990: 120) 


(21) ram=ne_ anil-ko —_uthaayaa 
ram=ERG anil=ACC life/carry.Perf 
“Ram had carried Anil” 


In (20) and (21), we can see the phenomenon of split ergativity where nominative-accusative system is seen 
in (20) and ergative-absolutive phenomenon is seen in (21) in a single language of Hindi-Urdu. 

One of the biggest issues with the tripartite case alignment system is the agreement system. Ergative 
DPs may or may not show verbal agreement. Variation in the domain of agreement is seen within the same 
genetic group of language. For example, within the Western Indo-Aryan languages (hereby WIA), we 
primarily see two types of variation in agreement system vis-a-vis ergative subjects. One of them is where 
the ergative marked subjects do not participate in the verbal agreement. For examples, see (20-21) of Hindi- 
Urdu above where the unmarked subject DP in an imperfective sentence in (20) participates in verbal 
agreement, in contrary to (21). In (21), the DP marked with ergative in a perfective construction does not 
participate in agreement. Rather, object agreement is seen. In Punjabi, another WIA language, optionality 
is seen between ergative and nominative case marking and subject-object alignment. For example: 


(22) 0 ae gali samjheyaa 
3.Sg.NOM this matter.F.Sg understand.Perf.M.Sg 
“He understood this matter” 
(23) o=ne ae gali samjhi 
3.Sg=ERG this matter.F.Sg understand.Perf.F.Sg 
“He understood this matter” Punjabi (Chandra et al (2016)) 


In (22-23), optionality is seen with the subject DP marking. It alternates with unmarked nominative and 
morphologically marked ergative in (22-23) respectively. It also correlates with subject and object 
agreement. The ergative DP does not participate in agreement. Simultaneously in Nepali, another WIA 
language, ergative subject agreement is seen. For example, see below: 


(24) bhaguta dekh-era sano kheta-le_ sam-Gu-na khoj-yo 
frog see-conj little boy=ERG catch-caus-inf try-Perf.3.Sg 
“Secing the frog, the little boy tried to catch it.’ 

(25) mero bahini/bahini-le kitab lekh-chin. 
my little.sister/little.sister((=ERG) book write-Pres.3.F.Sg 
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“My little sister writes books/ 1s writing a book / will write a book.’ 
Nepali (Lindemann, 2019) 


In both (24-25), the subjects are differentially marked, but both of them participate in subject agreement 
irrespective of the case marking(s). This data forms the crux of the problem of the paper. We see free 
variation in the subject marking with no implications on the verbal agreement system, unlike (20-23). 
Departing from the brief typological discussion of case and alignment above, to analyse the generative 
motivation for the existence of case, I choose to study the assignment of ergative case in split-ergative 
languages such as Nepali where verbal agreement system remains unaffected and shows no correlation with 
the subject marking optionality. Before talking about the special nature of these languages, I will briefly 
talk about some existing literature regarding ergative assignment. The assignment of ergative case has 
always been a locus of debate and a problem to case theory. Ergative case can be can vary according to 
subject- object, phi-features and transitivity. It shows an interesting variation (both micro and macro) 
among languages all over the world. It can be ‘configurational’ (Bittner and Hale, 1996; Marantz 1991) 
where the ergative case is assigned according to its structural position in comparison with other DPs in the 
construction. The ergative case becomes the dependent case (which becomes the accusative case in case of 
nominative-accusative constructions) in Marantz’s system as discussed above. Ergative case can also be 
inherent case, which is seen in different WIA languages where ergative case is assigned at the position 
where the external argument is introduced or at the position of agent theta role in v. This is also seen in 
Basque (Laka, 1993). 

Béjar (2000), to explain the complex agreement morphology in Georgian, introduces the notions of split 
phi-probe (which says uninterpretable person [a] and number [#] features enter into Agree separately, each 
determining its own Probe) and syntactic underspecification (that underspecified features in a language of 
third person and singular number, varying parametrically, are absent in the syntactic representations and do 
not take part actively in derivations) (Laka 1993). She places the [z] feature on the v head, to agree with 
the internal argument (v’s closest matching feature in 1ts c-command domain) and [#] feature on the T head, 
to agree with the external argument (T's closest matching feature in its c-command domain). 

To explain ergative displacement in Basque, Rezac (2003) wants to keep both the [a] and [#] probes on 
both T and v. The [z] probe on v increases its search space for Agree and agrees with the ergative in [Spec, 
vP] in case third person absolutives. The ergative goal 1s deactivated and the [z] on T is then blocked from 
agreement because of Active Goal Hypothesis. In Basque only third person is underspecified, not singular 
number. Sigurdsson (2012) notes that vP in ergative systems not only event-licenses ergative subjects but 
also triggers their ergative marking, much as the Appl head event-licenses indirect objects and also triggers 
their (commonly dative) case marking.” Chandra and Kaur (2016) propose the ergatives are stranded at v 
(though the treatment in unergatives is different than in transitives, as seen in Punjabi). In the next section 
I will argue that ergative case is not assigned during the operation “Agree’ under cither T or v, but 
somewhere else, i.e. by C. 


5. A proposal 


Hornstein et al (2004) declares that “1) the interactions of independent mental systems; or (2) “general 
properties of organic systems”” are two ways to approach any phenomena (complex or otherwise) in 
language. These are called methodological minimalism and substantive minimalism respectively. I will also 
attempt to tackle the question of “why case’ through the perspective of methodological minimalism. To 
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follow “Occam’s Razor’ and infuse theoretical simplicity in the system, defining case indeed becomes a 
problem. Either, other than Merge and Agree, the system has to allow another module for assigning case 
(in case of GB) or has to follow the case as by-product of Agree theorem (in minimalism).* 

To discuss Minimalism, the most important condition of the theory is to take into account the requirements 
of the performance systems of the CI (Conceptual-Intensional) and AP (Articulatory-Perceptual) interfaces. 
This condition motivated the leap from GB to Minimalism. Anything the system is going to do is by keeping 
into consideration the “legibility conditions’ of the interfaces. Therefore, to give the required output to the 
interface, the functional layers of the system play an important role. Other than the functional heads v 
(which assigns theta roles) and T (which assigns Tense), the other important functional phrase is CP 
(context phrase or complementizer phrase) which is the topic or discourse layer in CHL. This layer plays a 
vital role in connecting the pragmatic or discursive elements into syntax, thus making a step towards 
incorporating the world knowledge and referentiality. Portner et al (to appear) and Kumari (to appear) has 
shown that pragmatic notion of politeness has fed into allocative agreement of honorificity where CP 
captures this relation. In the line of this statement, it can be said that assignment of case can be an evidence 
of such feeding. Again Rizzi (2004) shows a cartographic analysis of split -CP structure like: 


(26) Force > Top* > Int > Top*> Foc > Mod* > Top* > QPembed > Fin > Subj 


As we in (26), the Fin of tense is closely related to the cartographic hierarchy inside CP. 

In connection to such dynamic role of CP, it is interesting to note that Hinzen pushes ‘referentiality’ to the 
“internalist’ enquiry and as a grammatical concept. I, taking the ‘referentiality’ concept from Hinzen, 
propose that CP is involved in ergative case assignment in some languages. I take nominative- absolutive 
as default case (Falk, 1997), and to denote the notion of referentiality, the system uses the ergative and 
accusative case as the non-default counterparts. During ergative case assignment in languages like Nepali, 
the case of the higher DP involved is assigned by C head of an active discourse context/ information 
structure. In cases such as Nepali, following Witkos (2003), I propose there is split probe in constructions 
with active discourse context/ information structure. In such cases, T gets the phi features via feature 
inheritance but C imparts the ergative case as seen in (24) with the -/e marker. In absence of an active 
discourse context and subject agreement, the probe is not split between T and C. T has all the features and 
it participates Agree as well gives its default case of nominative. 

To summarise, the interpretable features of the subject DP at the spec of vP agrees with the uninterpretable 
features of T and can get nominative case. However, to get ergative case, the DP must get case from C of 
CP. Therefore, case is not fully an immediate by-product of the Agree operation in instances where the 
ergative is assigned. The ergative is assigned higher up in the tree. Such type of assignments which are 
“higher in the tree’ are also discussed in Kidwai (2000) in relation to accusative assignment. For example: 


(27) john=ne cuha uske bag-m€ rak'a 
john=ERG — cuha his; bag-Prep put-PFV 
“john kept the rat in his bag” 

(28) john=ne  cuha=ko  uske bag-m€ rak'a 


8 Tt is to be noted that Agree is itself claimed to be a falsifiable operation as many recent evidence show. For example, in Zulu 
(Hilbert, 2015) case and Agree are separate operations, not that one is by-product of another. The case system of Zulu is a rare 
but such patterns are found in some other languages too. So if case is assigned to DPs just to fulfil visibility conditions at LF, it is 
to be pondered upon that what is happening in the languages like Zulu. 


98 
Banerjee / 2021 


john=ERG cuha=ACC his; bag-Prep put-PFV 
“John kept the rat in his bag.” 


In such sentences, according to Kidwai (ibid), “cuhe-ko ‘in (27) must be in a different position than ‘“cuhe’ 
in (28). In (27), “cuhe-ko’ has moved from the position of complement of VP to the outer spec of vP to 
undergo further computation. The higher space for DP to escape into can be the vP internal spec(s) of CP 
or internal CP (Belleti 2004 a.o., Jayseelan 2001). In terms of recent minimalist structure, the C of the 
internal C participates in operation Agree with the DP to impart accusative case. I will not discuss the 
detail version of how accusative case is assigned here because it is outside the scope of the paper. 
However, I want to show that accusative, similarly like ergative is sensitive to discourse information in 
some languages. 

The next section gives a detailed empirical support to my proposal, taking evidence from various 
languages which show optional ergative markings (hereby, OEM). 


6. Empirical evidence: Sylheti, Nepali, Kurtép 


In this section, I use empirical examples from Sylheti, Nepali and Kurtép to establish the hypothesis 
mentioned above. Sylheti is an Eastern Indo-Aryan language and a related language of Bengali. In two of 
its varieties at Assam, India and Moulvibazar area in the Sylhet district of Bangladesh, an -e marker is 
used ‘optionally’ as an agentive marker in 3 person’. But this marker behaves differently than the other 
established ergative markers in other split ergative languages in Western Indo-Aryan languages like Hindi- 
Urdu, Punjabi (as seen above) etc. because the former marker is present in all tense and aspect of Sylheti. 
Thus, this analogy tends to make Sylheti a split-intransitive ergative language, following the definition of 
Dixon (1994). However, Sylheti shows OEM in relation to ergative and nominative markings. Let us look 
at the following examples: 


(29) Faruk=e koi-s-e 
faruk=ERG say-PFV.3 
“Faruk has said (it)” 
(30) a. ke koi-s-e 
who say-PFV-3 
“Who has said it?’ 
b. faruk kois-e 
faruk.NOM say-PFV.3 
“Faruk has said it” 
(31) faruk ai-s-e/ *faruk-e aise 
faruk say-PFV-3 
“Faruk has come” 
(32) faruk =e samos-e bat xa-e 
faruk=ERG spoon=INS rice eat-3 
“Faruk eats rice with spoon” Sylheti 


9 The same marker is also used as an instrumental marker. 
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If we note sentences (29) and (30.b), the former has ergative marker, which the latter loses, though both 
the sentences are otherwise exactly similar. Both of them participate in subject agreement. The subject of 
(30.b) did not need Hinzen’s ‘referentiality’ as it has no discourse context to be encoded. Its discourse 
context is in the question in (30.a), whereas, the subject “Faruk’ in (29) needs ‘referentiality’ to ground 
itself in the discourse. Thus, C plays an important role here with an active discourse context. After agreeing 
with T, the DP gets its ergative case from C, unlike in (30.b). In (30.b.), the DP participates in Agree and 
gets (nominative) case with and from the same head T. The new proposed mechanism may look like figure 
below. 


(33) 


cP 


Cc’ 


Cc TP 
/™, 
I 
T vP 
or, 


/\ 
Ergative / 
were 


“See 


The tree (33) presents a schematic representation of how Sylheti assigns OEM. This also shows how the 
narrow syntax grounds the notion of ‘referentiality’ as the motivation for case. 
Similarly, Nepali, as also seen in (24-25) shows split ergative case system with its marker -/e in the 
perfective contexts. It surfaces during special pragmatic contexts, some of them are in the line of 
accomplishment (versus non-accomplishment) and obligation (versus desire) (Poudel, 2008) contexts. For 
example: 
(34) raam=lIe bihaan=dekhi paani ubh aa- i raha-eko cha 
Ram.M=ERG morning=ABL water fetch-COMPL remain-NMLZ Aux.NPST.M 
“Ram has been fetching water since morning (and he fetched some).’ 
(35) raam_ — bihaan=dekhi paani ubh aa-i raha-eko cha 
Ram.M.NOM morning=ABL water fetch-COMPL remain-NMLZ Aux.NPST.M 
“Ram has been fetching water since morning (but he has not been able to fetch).’ 
(36) mai=le yo kaam gar-nu par-yo 
1Sg.OBL=ERG this work do-INF fall-Pst.3.Sg 
‘T have to do this work.’ 
(37) ma__yo kaam gar-nu par-yo 
1Sg this work do-INF fall-Pst.3.Sg 
“T will do this work. 


These sentences (34-37) echo Kratzer (2002) that syntactic computation give different cases to the 
arguments of the same verb as seen in sentences (34) and (35), with verb “fetch” and sentences (36) and 
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(37) with verb ‘do’. The pragmatic context and the effect of completion of the task seem to make the 
differences in the above sentences, where the semantic meaning completion is encoded in the CP layer by 
ergative marker -/e. 

Hyslop (2010) also notices the influence of pragmatic factor in Kurtép, an East Bodish language of 
Bhutan, in the encoding of ergative -i marking. Kurtép shows ergativity in all three persons in past tense. 
Hyslop (ibid) calls the ergative nature of Kurtép as ‘pragmatic ergative’. She claims it to be “associated 
with topicality or emphasis’ or establishing a contrast. For example: 


(38) tshe ge-shang_ khit 
DM go-PFTCV.EGO 3.ABS 
‘So he left’ 

(39) khi —_ge-shang 
3=ERG go-PFTCV.EGO 
“She went’ 


(38) has an inactive discourse context where C does not assign case. However, in (39), C assigns ergative 
case to the DP to encode the discourse context. Kurtép shows default agreement in case in presence of most 
ergative. In Kurt6dp, unlike Sylheti and Nepali, C has both the phi-features and ergative case. Schematically, 
the case assignment in Kurtdp looks like: 


(41) 





Mcegregor (2009) named such phenomenon as ‘optional ergativity’. Thus, it is an empirically common 
phenomenon, very common in Nilotic group of languages in Africa. Such evidence can prove our theory 
of relating ergative(-accusative) case to the CP layer. Dubois (2017) also talks about “universal discourse- 
pragmatic pressure’ and similar alignment between discourse and ergativity which throws light towards 
patterns in ergative constructions. Balvin (2017) also talks about this similar “pragmatic function’ while 
choosing a particular type of construction in a language. 

This phenomenon of optional ergativity in different split-ergative languages ground my claim that encoding 
of referentiality can the motivation for case in the system. I further claim that (optional) ergativity is checked 
by functional head C of information structure CP. 
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7. Conclusion 


The Case is a questionable construct in Government and Binding theory and more so in Minimalist 
thesis, causing “case-chaos’. The theoretical motivation of case is hard to define and some scholars have 
attempted to discard the notion of case. However, the main point of the paper is to show that Case serves 
a definite purpose in the theoretical framework. It must be assigned properly from the lens of 
methodological minimalism to encode “referentiality”. The discourse laver (CP) plays an important role 
in ergative case assignment as a functional layer. It grounds the notion of referentiality in the narrow 
syntax by valuing ergative case. From the empirical evidence of optional ergativity in some languages, it 
is seen that Case (ergative and accusative) helps the narrow syntax to infuse the referentiality in the system. 
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Events ordering Comprehension and production of a ‘story’ in a ‘narrative’ is relative to our 
fundamental cognitive capacity of interpreting different temporal relations 
holding between the reported events in it. Our conceptual system makes 
sense of ordering of events from the grammatical and lexical information 
stored in the linguistic expressions. Most intriguingly, it is often found that 
the Cognizer hardly finds it problematic to reconstruct the order of the 
reported events in the story even when their ‘actual’ order is distorted in the 
narrative itself. This paper seeks to investigate how crucial the roles of the 
linguistic expressions are in bridging the gap between the narrated time of 
the ‘story’ and the way temporal relations are manifested in the ‘actual’ 
world. This paper will follow Reichenbach’s theory of Relative Tenses and 
attempt to demonstrate how the three temporal indices move along the 
timeline while comprehending a written discourse. Additionally, the role of 
complex predicate in Bengali is explored while studying event ordering in 
Bengali narrative discourse. A newspaper article has been analysed for the 
purpose. This entire process of event ordering takes place in our mental 
faculty. This paper, along with a syntacto-semantic approach, also studies 
how this process of event ordering is conceptualized in our cognition and 
tries to integrate the ‘\fental Space’ within the timeline model. 


1. Introduction 


While reading a text, a fundamental cognitive capacity lies in our ability to recognize the 
sequence of events that occur in the textual world- the ‘story’ within the ‘narrative’. (Genette, 
1979). Our conceptual system helps us to comprehend the order in which the events actually occur 
from the grammatical, aspectual and lexical information stored in the linguistic units. Trying to 
capture the grammatical nuances which help to order and connect the events together is no easy 
feat. But for the Cognizer, the reconstruction of the original order of the narrative happens with 
relative ease. Our cognitive capacity enables as to understand the text just as the author intended 
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it to be — not in the order in which the events are narrated, but in the order in which they occur in 
the ‘reality’ of the story world. 

This world is accessible to us -the readers- only through the language chosen and used by 
the authors/ narrators. Most narratives involve multiple events, located at multiple spaces at 
multiple points in time or spans of time, and involve multiple characters. And the events are 
presented in a temporally distorted order in most narratives. For example, a character in a narrative 
while narrating an incident, travels to a historical past time and recalls a connected set of events, 
through the stream of consciousness technique. Or a narrative narrating a series of events, starts 
with the most recent and presents the other events as leading upto it. Or a series of events are 
interspersed with flashbacks, so as to give more information or explanation to the reader. Or a 
narrative is interfered with viewpoints of different characters within the story. There are multiple 
techniques authors use to play with time in a discourse, such that the Reader/ Cognizer moves back 
and forth in time while comprehending and recreating the ‘story’ in their mind. An example of 
such temporal shifts is listed below. 

1. “And my life continued (E1)- I read books (E2) and did homework (E3), took tests (E4) 
and watched TV (E5), saw Daisy (E6), read (E7) and reread that college guide (E8) and 
imagined (E9) the array of futures it promised (E10). And then one night... I read her Star 
Wars stories (E11). Daisy’s most recent story “A Rey of Hot”, had been published (E12) 
the week before. I was astonished (E13) to see it had been read (E14) thousands of times. 
Daisy was kind of famous (E15). 


The story, narrated by Rey, takes place on Tatooine (E15) ...” (Green, 2018) (modified) 

On careful examination of the extract given above, it can be seen that E2-E10 can be 
considered events occurring within a span of time which is El. These events are habitual events 
and do not occur in the ‘story’ in the order in which they are listed in the ‘narrative’. E10 is an 
event within E9. E11 can be interpreted as a point at the culmination of El (an event marking the 
end of a certain way/phase of life). E12 occurs prior to E11, E13 occurs with E11, while E14 
happens over a period of time from E12 to E11. This extract shows clearly how the temporal order 
of the “actual” events in the “story” vary from the order in which they are represented in the text. 

Example (1) only demonstrates how the temporal order of the narrative and the 
reconstructed ‘story’ varies. What needs to be examined is how language encodes such temporal 
information which enable a cognizer to actually do the reconstruction in the order in which the 
events occur and the cognitive processes involved in doing so. When a Reader reads through a 
text, they continuously create in their mind a ‘Space’ where this ‘Story’ 1s conceptualized. This 
‘space’ is the projection of different temporal relations and interacting intentions of the narrative. 
Textual temporal informations are distributed over the network of ‘Mental Spaces’ created while 
reading the discourse, thus creating a ordered, connected story in the mind of the Cognizer. 

In Bengali, the distorted temporality mentioned previously is expressed through the system 
of Tense (Present, Past, Future) and its interaction with Grammatical Aspectual properties (Perfect, 
Progressive, Habitual) and Lexical Aspectual properties (Activity, Accomplishment, 
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Achievement, State). When one such Tense-Aspect Complex inflected on a Verb interact with 
another such complex, they modulate each other and this is represented in our cognition. And as 
every sentence progresses in a discourse, the conceptual structure of that discourse also gets 
modified in the mental space of the Cognizer. 

1.1, Research questions 


In continuation of the discussion above, this paper seeks to investigate the crucial roles the 
linguistic expressions play in bridging the gap between the narrated time of the ‘story’ and the way 
temporal relations are manifested in the ‘actual’ world. In other words, the ordering of the events 
in a narrative is not the mere reflections of its ‘actual’ counterpart: Actual temporality refracts 
through the language used in reconstituting it in a narrated story. The temporal construal of this 
space, corresponding to a particular ‘story’ in a narrative, is presented with the use of different 
linguistic expressions. Complex predicate is one among those large sets of expressions. The focus 
of this paper will be to study the temporal ordering of the events of the Story in the ‘Mental Space’, 
with a focus on the meaning construing capacities of the complex predicates. Since refraction is 
relative to the structure and function of the intervening medium, it becomes quite essential to 
explore how the shifts of ‘viewpoint’ (Evans & Green, 2006) linguistically contribute in unveiling 
skewed reality of the space. For the purpose of analysis, newspaper narratives of events such as 
accidents, criminal narratives will be considered. The choice is based on the fact that such 
narratives perform a variety of communicative functions like informing, involving the reader and 
convincing them and also involve multiple viewpoints and time periods/points on the timeline. 
(Krieken & Sanders, 2019). Bengali will remain the focus language in this paper. We will attempt 
to construct a syntax-semantics interface for organization of the discourse and to explain the 
relation between linguistic cues, their cognitive relations and how they modulate each other as the 
reader moves forward in narrative discourse. 

2. Theoretical Background 


For our analysis, we will try to adopt the timeline representation of events, with minor 
modifications. As we order events on the timeline, the events are also conceptualized within our 
minds. Hence, the events on the timeline also represent the mental spaces created by those events 
respectively and the connecting lines represent the movements thereby. 

At the discourse level, the terms ‘narrative’ and ‘story’ has been borrowed from Genette’s 
theory of ‘Narrative Discourse’. (Genette, 1979) According to this theory, ‘narrative’ refers to the 
entire narrative text or discourse- the signifier-which includes the events as well as the act of 
narrating. ‘Story’ is the narrative content or the reconstituted string of events- the signified. While 
reading a narrative, it is this story that our cognitive faculty reconstructs to make proper sense of 
the discourse. How such reconstruction of the story occurs from the information language carries 
and how the string of events is temporally organized in our psyche, is the object of analysis of this 
paper. 

Moving forward to the Tense and Aspect system, this research follows the traditional 
Reichenbachian (1947) framework of relative tenses (S-R-E approach), where any temporal 
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relation can be defined by means of these three arbitrary and abstract time points -Speech Time 
(S), Reference Time (R), and Event Time (E). While Reichenbach and other philosophers and 
linguists working on time and tense have been able to define speech time and event time, the 
definition of reference time has remained vague. However there does exist a third point or index 
in trying to explain the relative notion of tense. Unlike the initial Reichenbachian time of reference, 
however, it is not a fixed point, but varies with context within the discourse. In his Elements of 
Symbolic Logic (1947), Reichenbach have himself drawn attention to the inconclusive nature of 
his Reference Point. 

“Tt is not clear which time point is used as the point of reference. This determination is 
rather driven by the context of speech.” (Reichenbach, 1947, Elements of Symbolic Logic) 

For the current purpose, the relation between Speech Time (S) and Event Time (E) will 
define Tense while information about Grammatical Aspect will be defined by the relation between 
Event Time (E) and Reference Time (R). 

For Lexical Aspect, this paper will follow Vendler’s four categories- States, Activities, 

Accomplishments and Achievements (Vendler, 1957). Since lexical aspect is an inherent 
property of the verb, it is of primary importance in comprehension of verbs which are unmarked 
for tense and grammatical aspect. In case of Bengali complex predicates, the lexical aspect of the 
main verb is blocked and it is only the light verbs which carry information about the lexical aspects. 
Furthermore, they interact with temporal adverbs, and the information about the internal structures 
of simple verbs and complex predicates are vital in comprehension of event construal and construal 
of connected events in discourse. Section (3) on Data analysis will demonstrate further how the 
above- mentioned interactions take place. 
While temporally ordering a discourse, a syntax semantics interface becomes necessary. The 
lexical aspect, grammatical aspect and tense interact with each other as well as with other temporal 
adverbs. This justifies the need for explicitly including aspectual features within the traditional 
syntactic framework. The model used in this paper is proposed in Karamakar (2008) and developed 
gradually in Banerjee & Karmakar (2016), Banerjee (2018), Banerjee (2018) and reproduced here. 
The integrated syntactic structure is as follows and will be used for the rest of the syntacto-semantic 
representations in this paper. 
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2. AgrSP Relation between Tense and Person markers 
Spec AgrS 
AgrS f TO ee "Relation between STandRT-=——SS 7 
ar 
ff GrAspp Relation betweenETandRT SS 
Spec Asp 
VP-EXTERNAL 


VP-INTERNAL 


ee ee et ene eee eee 








The VP-Internal layer takes into account the entire argument structure (the subject which 
originates in the specifier position of the higher VP, the verb and the object which originates in the 
specifier of the lower VP along with its agreement features). It also contains the Lexical Aspect or 
the default Situation Type of the verb and the adjuncts which semantically modify the situation 
type at the sentence level. The VP-Internal Layer shows the interaction between the Lexical 
Aspect, adverbial phrase and the verb. 

The VP-External Layer gives information about the Grammatical Aspect (in terms of 
boundedness and continuum or Perfectivity and Imperfectivity), which refers to the relation 
between RT and ET (the time span spent during the event). It is c-commanded by (T)ense which 
refers to the location of the event in time. It is further dominated by AgrSP, which refers to the 
subject-agreement features. The T c-commands the entire Aspectual categories; the GrAsp c- 
commands LexAsp which further c-commands AdvP and finally the Verb. 

The term “Mental Space’ refers to the mental spaces as defined and developed by Gilles 
Fauconnier (1985, 1994, 1997) in his version of Mental Spaces Theory. While processing a 
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narrative, the cognizer/reader constructs a mental model of the story world, based on the events, 
characters, time and space. It is conceptualized as a network of mental spaces and the linguistic 
cues guides the cognizers/readers through this network, where addition of each sentence leads to 
creation of new spaces and addition of information to the existing spaces, thus modulating the 
existing conceptual structure of the story. The final structure is the result of changes brought about 
by every bit of information added to the starting point or ‘base’. 

According to the Co-compositionality Principle, the interaction between expressions in a 
context varies with the relationship between them. This relationship modulates the dynamics of 
the mental space in which language exists. Even the nodes of the syntactic trees interact in a similar 
manner- they are relational in nature and modulate the next higher level. The structure of Tense 
(Present, Past, Future) gets modified when Grammatical Aspectual properties (Perfect, 
Progressive, Habitual) and Lexical Aspectual properties (Activity, Accomplishment, 
Achievement, State) interact with it. When one such Tense-Aspect Complex inflected on a Verb 
interact with another such complex, they modulate each other and this is represented in our 
cognition. And this is how as every sentence progresses in a discourse, the conceptual structure of 
that discourse also gets modified. 

This paper will follow verb construction from Chatterjee (1939). According to his well- 
accepted theory, there are twelve tenses in Bangla. The simple tenses are Present, past and future. 
They combine with simple, progressive, perfect and habitual aspect to form complex tenses. 
Bengali tenses have clearly demarcated aspect markers. Simple tenses show an absence of aspect 
marker; ‘t/ in progressive tenses and ‘et{’in perfect tenses serve as the aspect markers 
respectively. Other than these, there are three more tenses- past habitual, progressive habitual and 
perfective habitual. 

3. Data Analysis 


A short newspaper narrative will be analyzed using the theoretical frameworks discussed in 
Section 2. 


3. Sl Paanch diner mathae onoshon protyahar’ korlen (El) Howrah _ jela 
shongshodhonagarer dui rajnoitik bondi Barun Sur ebong Ajoy Das. 
S2. Sombar theke tin dofa dabite onoshon shuru korechilen (E2) tara. 
S3 Ekti daabi jel kortripokhyo mene neowa-e (E3) shukrobar tara onoshon protyahar 
koren. (E4) 


S4 Dui bondir ghonishtho der obhijog chilo (E5a) je bela 12tae cell-e dhukive deowa 
hoto (ES5b) Barun ebong Ajoy ke. 

S5__ Edin kortripokhyo janan (E6a), onyoder motoi bikele cell-e lock up kora hobe (E6b) 
tader. 
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At the beginning of the narrative, since this is a third person narrative, there is a space 
created which is the ‘present of the reader’- a space embedded in reality — the present when 
the reader reads the text. As the act of reading starts, another space is created which include 
the speech time / narrative time of the text. The reader is transported to the ST / NT position 
at the beginning of the narrative. It is the Present time with respect to the narrative and is 
part of the Narrative World. 

When the reader reaches El, they enter the Story World. The verb structure at El is a 
complex predicate with internal structure N-V- protvahar korlen ‘withdraw do-SIMPL- 
PAST-3past’. The verbal complex means to completely withdraw the fast. The nominal 
component protvahar ‘withdraw’ provides both the argument structure and Lexical aspect 
(Achievement). The light verb kor-‘do’ 1s semantically bleached and only carries the 
inflections for grammatical aspect (simple, unmarked), tense (past) and person. The 
adverbial adjunct marks the time of the event as paanch diner mathae ‘at the end of 5 days’. 
The complex predicate is hence a punctual or point event in the past — marking the 
culmination of the durative event onoshon ‘fast’. According to the Reichenbachian S-R-E, 
the reference time is situated with the event time. 

From a cognitive viewpoint, when information from S1 is structured in the 
Reader’ s/Cognizer’s mind, a mental space El 1s created. The total temporal and aspectual 
information that is presented in the integrated syntactic structure (shown below) occurs 
within the space El. The adverbial adjunct 5 diner mathae ‘at the end of 5 days’ is focused 
upon in the tree diagrammatic representation, leading to a corresponding cognitive focus 
on the mental space El. The viewpoint cooccurs with the Speech Time/ Narrative Time 
situated outside the ‘story’ world — hence it is inter-sentential. 

The distance between E1 and ST/NT or the Present of the Narrative is not explicitly stated, 
hence it can either be that El overlaps with ST or is located prior to ST. Since this is a 
newspaper report, it can be imagined that the narration is taking place just as the event El 
ends- hence E1 overlaps with ST/NT. The other option that can be imagined is that the 
event El has occurred at a considerable time before the ST/NT of the reporter/ narrator. 


E2 once again has a complex predicate of structure N-V- shuru korechilen ‘start do -PERF- 
PAST-3past’. Shuru ‘start’, which is the nominal element in the verbal complex, provides 
both the argument structure and the Lexical Aspect (Achievement). The semantically 
bleached light verb kor- ‘do’ carries the inflection of past tense and perfective aspect. So, 
it is a punctual event, focusing at the start or begining of the onoshon ‘fast’ and is situated 
in the further past with effect going on till the point of its conclusion (E1), which is the 
position of the narrative voice at this point (reference point). The conceptual space must 
hence be created to the left of El (preceding E1) on the timeline, containing temporal and 
aspectual information of E2. The reference time must occur at a point between E2 and 
speech time, more specifically at E1. 
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The durative adverbial adjunct sombar theke ‘from Monday’, is the focus phrase in the 
tree diagrammatic representation and results in shift of focus to E2. It gives the event a 
sense of accomplishment making it durative, rather than punctual. From the temporal 
adverbs paanch diner mathae ‘within 5 days’ (S1) and shombar theke ‘from Monday’ (S2), 
it can be further derived that E2 occurs on a Monday and the temporal duration between 
E2 and E1 is five days. The viewpoint still remains with the Speech Time/Narrative Time. 
S3 consists of a serial verb construction with two complex predicate of structure V main V 
light —N V light. The first V main-V light construction is mene neowa ‘accept take— PERF’ 
(E3). The main verb mene ‘accept’ provides the argument structure to the verbal complex. 
Neowa ‘take’, is an Achievement verb, and hence is punctual in nature. The achievement 
aspect indicates a change in phase, and thus provides the sense/meaning of completion of 
event when added to the main verb. Together, the verbal complex takes up an inflection -e 
for indicating completion of the event. However, since it is a part of a serial verb 
construction, the tense and person markers are blocked. Without the tense marker it 
becomes difficult to properly place the conceptual space created for E3 on the time line. 
But, conceptually, E3 must occur at a point after E2 and before E1 (that is after the start and 
before the end of onoshon ‘fast’. The focus now shifts to E3 and view point shifts to E2, 
from where the event E3 is now viewed. 


When the reader reads the second part of S3, he takes into his cognition the final N-V light 
complex predicate of the serial verb construction (E4) - protyahar koren ‘withdraw do — 
SIMPL- PRES- 3pres’. The nominal component of the verbal complex protyahar 
‘withdraw’ provides both the argument structure as well as lexical aspect (Achievement). 
The light verb koren ‘do’ is completely semantically bleached- its ability to take argument 
structure and provide lexical aspect are both blocked in this construction. It provides the 
structural support to carry inflections for Grammatical Aspect (Simple, unmarked), Tense 
(Present) and Person (3" Person). 

Since the complex predicates E4 and E1 are the same, E4 must refer back to E1. In other 
words, E4 and E1 refers to the same event and hence must take place at the same time as 
El, irrespective of their Tense and Aspect situation. What is different about the two events 
is the shift in viewpoint and focus. The view point must shift to E3 from where F4 is viewed 
— it is an intra-sentential viewpoint at this junction. The adverbial adjunct shukrobar 
‘Friday’ results in shift of focus to E4. The present tense indicates that the event is 
occurring at the time of speech or narration. From Reichenbach, it is clear that speech and 
reference time overlaps. Thus, R4 and S4 occurs within the Mental Space E4. Conceptually 
it become clear to the reader that the Mental Space E4 overlaps with the Mental space 
where the ST/ NT is located. 

Cognitively the mental space created for E4 must overlap with El. The speech time and 
reference time must shift to E1/E4 on the time line. Also, E3 results in E4 since they are 
connected by the causal connective — ‘e’. Therefore, the position of E3 on the time line 
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must be at some point before E4/E1. The temporal adverbs 5 diner mathae and shukrobar 
must refer to the same point on the timeline. 

The reader moves to S4. The reference time and speech time already exists at E1/E4. S4 
consists of two sub events, demarcated by two separate verbal complex- a complex 
predicate of structure N-V- obhijog chilo ‘complaint be- SIMPL-PAST-3Past’ (E5a) and a 
complex predicate structure with three verbal elements- dhukive deowa hoto cause to enter 
give to be —- HAB — PAST?’ (E5b). 

In ESa, obhijog ‘complaint’ is a verbal noun, hence accounts for both argument structure 
and lexical aspect (State). Chilo ‘be- SIMPL-PAST-3Past’ takes up inflection for 
unmarked or simple grammatical aspect and Past tense. The internal structure of the event 
is [- bounded, + continuum]. The past tense inflection provides information that event time 
corresponds to the reference time (E5a= RS5a), both of which occur prior to the speech time 
(S5a). A stative event generally provides backgrounded information/ narrative elaboration/ 
explanation/ description, while the narrative is paused. Thus, E5a is a backgrounded 
information- an explanation-for the main event. 

Cognitively, the mind of the reader interprets ESa as a connected event which had occurred 
in the past and is affecting the current string of events. It does not form a part of the 
continuous chain of events on the timeline and does not introduce a new event. However, 
the causal connection leads the reader to interpret E5a as a connected event, albeit creating 
a temporal gap with El, E2, E3 and E4. 

E5a must have occurred before E2 — obhijog ‘complaint’ entails there must have been some 
problem in the past which leads to complaint and not solving the complaint must have 
resulted in the onotion ‘fast’ (E2). During the event ESa, the viewpoint must also exist at 
the mental space E5a along with reference time. 

The adverbial adjunct bela barotae ‘at 12 in the afternoon’ which moves to the focus 
phrases, leads to creation of another mental space E5b where the event marked ESb takes 
place along with its related temporal and aspectual information. ESb is a complex predicate 
where the main verb dhukiye deowa ‘cause to enter give’ (Nominal Verb Phrase) carries 
the argument structure and the lexical meaning. When the internal structure of dhukiye 
deowa ‘cause to enter give’ 1s considered, it has a V-V structure. While dhukive ‘cause to 
enter’ provides the argument structure, deowa ‘give’ provides the lexical aspect (Activity). 
The verbal complex is nominalized and when attaches to the light verb hoto-HAB-PAST. 
As with any other N-V construction, the light verb is completely bleached and the it works 
to only carry inflection for past tense and habitual aspect. That is, an event is being repeated 
over a period of time in the past. The speech time remains at E4/E1, reference time remains 
at E5a and the event time E5b moves to a point prior to ESa. In other words, the mental 
space E5b is located prior to E5a on the time line. The viewpoint remains at E5a. 

ESb is also not a part of the main event, but a separate, connected event leading up to it. 
Even though E5a and ESb can be ordered, causally connected and plotted on the timeline, 
the ‘real time’ temporal distance can’t be derived. In other words, ES5a-E5b can be located 
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immediately prior to E2 (continuous with E2) or there can be a real-time gap between them. 
Also, E5b being a continuous event, the reader can only assume how long the event has 
been going on. 


The speech time is at E4/E1 when the reader reaches S5. S5 consists of a matrix clause 
(E6a) and embedded clause E6b. The adverb edin ‘today’ moves to the Focus creating a 
mental space E6a. Cognitively the viewpoint shifts to the present, that is to a point after 
E1/E4. E6a janan ‘to let someone know’ is a single verb in simple present tense with an 
Achievement type lexical aspect. In other words, it is a point event taking place in the 
present- the present of the narrative. The last bounded event was E4, so logically E6a must 
follow E4/E1 on the timeline. However, since E1/E4 overlaps with ST/NT or the present 
of the narrative and so does E6a, the mental spaces E1/E4 and E6a overlaps and belongs to 
the cognitive space which carries the ST/NT, following E3. Conceptually, it is an 
elaboration for E3. Following Reichenbachian S-R-E, speech time, event time and 
reference time coincide at this poi 


In E6b, the complex predicate is bilingual- /ockup kora hobe ‘lockup do to be — SIMPLE - 
FUT-3rur’ refers to a promise of a future event. Lockup kora ‘lockup do’ acts as the main 
verb-internally, it has an N-V structure. Lockup takes the argument structure and provides 
the lexical aspect (Activity). Together, the structure is nominalized and combines with the 
final light verb hobe ‘to be-SIMPL-FUT. The light verb takes up inflections for simple 
future tense. Since the complex predicate refers to a future event, it must occur after E6a 
(speech time is now at E6a). 

It also implicates that since E6b is a habitual event (even though it is not structurally 
implicated as habitual) it will be repeated for a particular period of time in the future. The 
speech and reference time co-occurs at E6a. While the focus shifts to E6b the view point 
remains at E6a. 


4. Results 


The figure given below (4) is a timeline representation of the conceptualization of the 


discourse analyzed above through construction of mental spaces. Vx represents the position of the 
viewpoints at the respective event times Ex. On careful study, it can be clearly seen that the event 
order is radically different from the order in which events are produced in the text and the time 
moves back and forth to construct the ‘real’ order. The arrows represent the temporal shift of events 
on the timeline. Every conceptual space or node consists of integrated syntax-semantics interface 


information of the respective events presented in the form of syntactic trees. An example is given 
in Figure (5). The rest of the events are represented in a similar manner within their respective 
conceptual spaces. Moreover, E2, E3 and E4/E1 form the main event onoshon ‘fast’. Also, due to 
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the dynamic nature of temporal ordering, E4/E1 are corrected later on and conceptualized as 
overlapping with the ST/NT or the Present of the Narrative 


REAL WORLD 
4 . NARRATIVE WORLD 


Reader is transported here 


STORY WORLD 


PRESENT OF 
READER 





T GrAsp' 


~ 
we. vexAspP 
GrAsp aa 
Simple a 
Spec 
LexAsp 
Activity Vv 
Subj oe 
Vy" 
AdvP 
5 diner mathae 
Vv 


protyahar korlen 


Spec 








Integrated Tree diagrammatic representation of event El, contained in the conceptual space El 


5. Discussion 


A narrative discourse is made up of multiple events connected together. In order to make sense 
of the entire discourse, the individual events need to be analyzed first. Temporal information of 
the individual events are encoded in the Tense and Aspect system, which are presented as 
inflections on verbs in Bengali. The temporal adverb, which 1s generally ‘focused’ upon and 
modulates the existing temporal structure, also helps create a new ‘mental space’. 

5.1. The Complex Predicate Problem 
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Aside from the simple verb structures where the single main verb contains both the 
argument structure and lexical aspect as well as bear grammatical inflections, there are, in Bengali, 
an abundance of complex predicates. Trying to define and classify complex predicate is beyond 
the scope of this paper. However, the knowledge of the syntactic components of the complex 
verbal constructions is essential because this status determines their semantic contributions to the 
event structure and aspectual scope. Based on this, there are two types of constructions- a V main 
-V light structure and an X-V light structure (where X can be either a Noun/Adjective/Preposition). 

When a complex predicate consists of a main and a light verb (V-V structure), the main 
verb provides only the argument structure while its lexical aspect is blocked. The light verb, on 
the other hand, provides only the lexical aspect while its argument structure is blocked. Thus, the 
two individual elements provide the two different aspect of meaning — while the main verb provide 
the argument structure, the light provides the lexical aspect. Together they form a complex entity 
and take up information on tense, grammatical aspect and person in the form of inflection on the 
light verb. 

Thus, the main verb carries the lexical meaning while the light verb is semantically 
bleached or modified. In the integrated Syntactic structure mentioned in Section 2, the light verb 
would belong to the VP-Internal layer, move to the LexAsp node, and take up information on 
Lexical Aspect. Together the verbal complex would move out of VP-Internal layer and take up 
inflections accordingly. 

When a complex predicate consists of an X and a V light component (X-V structure), the 
X component acts as a verb and provides both the argument structure as well as the lexical 
aspectual component. The light verbs are semantically completely bleached and only act as 
structures to carry inflections for tense, grammatical aspect and person. For the X component and 
V light component to occur together as a single complex, they need to possess a similar argument 
structure -it is not just the number of arguments they take that needs to be compatible but the theta 
role needs to be similar as well for them to function together as a unit. 

For instance, 


6. Bujhlam, du raat na ghumiye o shokal — e ektu ghumo-e-cch-e (‘ghumo-PRES-PROG- 3pres’ ). 
7. Bujhlam, du raat na ghumiye o shokal — e ektu ghumiye ni-a-chch-e. (‘ghum-iye ni-PRES- 
PROG-3 pres’ ) 


(6) is an instance of a simple verb, while (7) shows the use of a complex predicate in the same 
context. In example (7), the main verb is the same as that in example (6), except that its lexical 
aspect is blocked. The lexical aspect for ghum ‘sleep’ is a State in example (6). The light verb ni- 
‘take’ in example (7) does not take any argument but gives an Activity sense lexical aspect to the 
entire verbal complex in (7)- that is, a sense of [-bounded] and [+continuum] to the complex 
predicate. 

Other than these two structures, there are other complex predicate structures as well. These 
includes complex predicates involving three components (representing single event) and bilingual 
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complex predicates. However, a generalized discussion is beyond the scope of this paper even 
though we will come across instances of both in the next section. 
5.2. Creation of Mental Spaces and Ordering of events in the spaces via shifts in viewpoint: 

Cognitive linguistic framework, establish connections between the linguistic features and 
the cognitive representations of narrative discourse with the aim to understand how meaning 
emerges in the mind during reading. This work adopts a mental space approach- a space is created 
as the reader progresses in discourse. Each space hosts an event- and the complexities related to 
it. There is a relation between the linguistic expressions and the cognitive representations at the 
level of the individual events and this relation is embedded in the mental space. The spaces are 
connected with each other at discourse level through their relative position on the timeline, 
connected via arrows. 

The tense -aspect system provides information about the temporal location of an event. But 
within a discourse, this temporal location is dynamic in nature in that every event is temporally 
located with respect to the other events within the discourse. Each mental space also hosts a 
number of latent viewpoint that can be selected as the temporary vantage point of narration. During 
the construal of the narrated events, only one viewpoint can be selected over the others. The 
temporal location is also dependent on the viewpoint. The point of view from which an event is 
viewed varies and shifts with progression in reading/comprehension. The viewpoint can be intra- 
sentential and inter-sentential, that is, looking at a sentence from within the event or looking at it 
from outside the event. 

6. Conclusion 


Ordering of events is a dynamic phenomenon when it comes to our cognition. Quite like the 
gradual development of an image by an artist, a discourse gradually and dynamically builds up in 
our mental space. It is indeed true that more often than not the linear or canonical order of time 
varies from the order presented in the narrative. The structure of canonical time gradually unfolds 
itself through the narration. 

This paper has tried to look into three major aspects of temporality by analysing in details a 
newspaper narrative. First is the notion of complex predicate. Even though substantial work has 
been done in the area, a lot of avenues remains unexplored and unexplained. The primary objective 
of this paper was to explore one such viewpoint, especially in discourse, where time is itself a 
dynamic phenomenon. How the components of complex predicates — the X or main V component 
and the light V component individually provide information towards event ordering as well as how 
the components acts together as a unit, has been studied in details in the paper. 

Secondly, the purpose of this paper was to attempt to re-order events in a narrative discourse 
and recreate the ‘Story’ as it is in reality. The final timeline representation as shown in (10) 
demonstrates the position and movement of different temporal elements and re-ordering of events, 
thus recreating the ‘real’ ‘story’. The third aim was to look at the process of reordering and 
recreating the story from a cognitive viewpoint. The ‘Mental Spaces’, representing the events and 
carrying both syntactic and semantic temporal information has been studied in details and 
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organized on the timeline representation in (4). While doing that, looking into the viewpoint from 
which an event is analysed, becomes a necessity. The integrated syntactic framework developed 
previously has proved to be a vital tool in the analysis. 

However, syntax, semantics and their interface hardly prove to be sufficient in discourse 
analyses. As not every information is structurally always present, the role of pragmatics often 
comes to the rescue. Language deals with “what we say” but indeed leaves a lot of space for “what 
we do not say”, yet the cognition of the brain of the reader is able to rightly capture the unsaid 
nuances. For example, in event ES5a, the structural information locates the event as prior to E4/E1, 
but its exact location on the timeline can only be derived through implicatures an (Fauconnier, 
1994)d causal connections. Thus, what needs to be further examined is the communicative 
intention of the author. How meaning emerges in the minds of the reader is a complex process and 
no single structural theory can be considered enough to explain the complexity. 
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Psycholinguistics; The quest to understand how language and concepts are organized in the 
Semantic Priming; Mental human mind is a never-ending pursuit undertaken by researchers in 
Lexicon; Response Time; computational psycholinguistics; simultaneously, on the other hand, 
Degree of Priming; Word2vec researchers have tried to quantitatively model the semantic space from 
model. written corpora and discourses through different computational approaches 


— while both of these interacts with each other in terms of understanding 
human processing through computational linguistics and enhancing NLP 
methods from the insights, it has seldom been systematically studied if the 
two corroborates each other. In this paper, we have explored how and if the 
standard word embedding-based semantic representation models represent 
the human mental lexicon. Towards that, we have conducted a semantic 
priming experiment to capture the psycholinguistics aspects and compared 
the results with a distributional word-embedding model: Bangla word2Vec. 
Analysis of response time indicates that corpus-based semantic similarity 
measures do not reflect the true nature of mental representation and 
processing of words. 


1. Introduction 


Mental lexicon (ML) is the mental dictionary used to explain how words are interpreted in the human 
brain (Grainger et al. 1991, Drews & Zwitserlood 1995, Taft 2004). While words have differing degrees 
of affiliation and are regulated by different linguistic and cognitive restrictions, the exact meaning of 
the connectivity in the mental lexicon is unclear and has been a controversial issue for decades. 
However, a better perception of how words are interpreted and organized in the human brain can be 
used to improve our understanding of the underlying cognitive processing, as well as create cognitive 
consciousness in the Natural Language Processing (NLP) applications. Many cognitive studies in 
various languages are being performed to explore the semantic and linguistic representation of words in 
the ML. In certain cases, semantic priming methods are used to conduct such research (Dehaene et al. 
1998, Neely 1977). Priming is the method of exposing a stimulus (called a prime) that allows a similar 
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stimulus to be noticed more easily (called the Target). For example, if the word "Banana" is 
accompanied by a similar stimulus like "Yellow," the amount of time taken to remember it would be 
lesser than if it is preceded by an unrelated stimulus like "Home." The data gained from such cognitive 
experiments are then used to construct distributed computational models of word representation and 
processing in the ML. There are several attempts to model mental lexicon in computational ways for 
representing semantically similar words. Among them, the distributional semantic representation model 
is one of the most widely used. The term "distributional semantics" refers to the classification of 
semantic correlations between words based on their meaning distributional properties (Harris 1954). As 
a result, semantic relatedness can be determined by comparing word co-occurrence behaviors in text 
corpora. This theory has been a valuable line of study in linguistics for the past couple of decades (Lund 
& Burgess 1996, Landauer & Dumais 1997). Thanks to recent advances in the field of deep neural 
networks, a more versatile version of distributional semantics models in the form of word2vec word 
embedding have been proposed. However, the majority of these works are done in English, French, 
German, Arabic, and Italian. There have been very few efforts to run such research for Bangla ML. 
Furthermore, an important feature of the analysis is determining if the distributional method to word 
representation is truly representative of how the human mental lexicon operates. 


In this article, we have studied how and whether traditional word embedding-based semantic 
representation models correctly reflect the human mental lexicon. The article focuses on the number of 
examples where extremely similar words found by computational models have been shown to have no 
priming effect on users. On another, word pairs with a low corpus similarity will lead to a great deal of 
priming. 

The main objectives of our article are discussed as follows: 


e We perform a semantic priming experiment on the basis of 300 word-pairs to study the 
structure and interpretation of Bangla words in the mind lexicon. 

e We have computed word embedding for each of the 300 Bangla word pairs and 
calculated the semantic distance for these word pairs. 

e Weare trying to see if there is a correlation between the computed cosine similarity and 
the priming effect. The null hypothesis is that words with high cosine similarity scores 
would expect a high degree of priming. 

The remainder of the article is set out as follows: The related work is found in Section 2. In section 3, 
the semantic priming experiment on Bangla words is clarified, and in section 4, the distributional method 
for measuring semantic similarity is defined. Section 5 contains the findings and discussion. 


2. Related Work 


One of the phenomena in cognitive psychology that demonstrates how well the human processes and 
represents semantically similar words in various languages, is the “semantic priming effect” (Neely 
1991, Lucas 2020). Initial experiments with semantic priming looking into prime-target word pairs 
examined the principle of word association: a term like "CAT" (the target) was found to appear 
considerably faster with a similar word (the prime, such as “DOG’) rather than a word that was unrelated 
(such as "KEY") in a decision-making task (Meyer & Schvaneveldt 1971). 


Semantic relationship comprises any relationship or the similarity in meaning between two 
terms (c¢.g., “car-bus”). The use of two stimuli that are usually correlated with one another is known as 
associative relatedness. For example, "spider" and "web" are two terms that are often connected in 
memory, so hearing one of them will prime the subject to respond more easily when the other appears 
(Postman & Keppel 2014). Early research into semantic or associative relationships, has demonstrated 
that associative relationships derive not only from semantic similarities but also the co-occurrence of 
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word pairs in the corpus (Fischler 1977, Fodor 1983). There are many words that are 
semantically related but still not associatively related (e.g., truck-van). Hence, we do not have a specific 
description of a "semantic relationship" (Neely 1991, McRae et al. 1997). 


In this article, we have concentrated on the subject of semantic priming in Bangla. We 
have carried out the priming experiment and analyzed results with the computation model to understand 
the semantic behavior between word pairs and to establish how priming of human cognition is 
significantly correlated to the computational model in the Bangla Language. 


3. Semantic Priming Experiment 


To examine the effects of priming on semantically related Bangla words, we have manually obtained a 
semantic similarity score for 300-Bangla word pairs from a group of 100 participants. We have requested 
the participant to give the scores from | to 10. A score of 1 indicates that the word pairs are semantically 
unrelated, while a score of 10 indicates that they are very similar or synonyms. The number of words 
belonging to each similarity score is depicted in Figure 1. 
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Figure 1: Graphical representation of No. of words concerning each similarity score. 


In Figure 1, the bar chart indicates the sum of word pairs in various semantic similarities, for 300-word 
pairs. The participant is given the word pairs to choose the data set. Based on the above dataset, we have 
performed the masked priming experiment as discussed in (Forster & Davis 1984, Rastle et al. 2000). 
In this system, the prime word (for example, Bidyalaya (School)) is squeezed between a forward pattern 
mask and the target stimulus (for example. Chatra (Student)), which serves as a backward mask. This is 
explained here. 


prime (1000ms) Bidyalaya (SCHOOL) — target (2000ms) Chatra(STUDENT) 


The participants are asked to determine if the given target word is true or not after the target word has 
been introduced to them for a set amount of time. Participants make their decisions by clicking the keys 
on a regular QWERTY keyboard marked 'T' (for true words) and 'F' (for invalid words). The system 
timer tracks the time taken to press one of the keys after the target word is shown (also known as the 
reaction time (RT)). Once again, we display the same target word but with a separate visual probe called 
the CONTROL word. The CONTROL word does not demonstrate any semantic or orthographic 
correlation with both the prime and target word. For example, Gicha(tree) and Sakha (branch) is a 
prime-target pair, and the corresponding control-target pair could be Katha (talk) and Sakha (branch). 
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The experiment was carried out with the help of a graphical user interface (GUI) called DMDX’, which 
randomized the prime-target pairs. 


3.1. Dataset and Experimental Setup 


The 300-word pairs are hand-made from the corpus of Bangla about 3.2 million unique words. The 
corpus consists of popular Bangla writers' writings; we have also added texts from Bangla Wikipedia, 
blogs, and news sources to the corpus. The terms are selected in such a manner that they represent a 
significant amount of dissemination across the whole corpus. This would also be useful for building the 
word embedding that will be used to compute semantic similarity. Table 1 shows the statistics report of 
the dataset. 


Table 1. The statistics of the selected word pairs. 


Term or word type Avg. TL Avg. no. of CC Avg. TF 
Semantically related 6 2 38 
Semantically not related 5.8 1.6 32 


From Table 1, we have depicted the average length of the word or term (Avg. TL) as six, the 
average complex character present in the word (Avg. no. of CC) as two, and the average corpus 
frequency of the word (Avg. TF) as thirty-eight for semantically related word. The semantically not 
similar words have the Avg. TL as five-point eight, the Avg. no. CC as one-point six, and Avg. TF as 
thirty-two. 

A small exercise about the experiment was given to all of the participants before the start of the 
true experiment. The ten pairs of words were selected differently for trial examination. Our experiment 
was conducted in two phases. The experiment process was the same at both phases, except for the word 
pairs. Each phase takes around 12 minutes. Within these two phases, there was a break of at least 2 
hours, since a continuous sitting of 12-minute took a considerable amount of concentration and was also 
difficult for the participants. Again, we divided each experimental phase into four sub-settings of three 
minutes each, with a two-minute break in between. 


3.2. Implicit insight into semantic similarity 
3.2.1. Methods and Material 


We divided the 300-word pair collection into two groups. Group-A defined the semantically 
related word pairs, and Group-B defined the unrelated word pairs. The table below shows the illustration 
of the two groups. 


Table 2. An example of two groups 


Group Explanation Example 


Group-A Words are related Wifal(Majhi(boatman)) ----- CoVI<PI(Nauka(boat)) 
Group-B Wordsare not related — (>I (Saurabha(Saurabha)) ------ CaI<PI(Nauka(boat)) 


: http://www.u.arizona.edu/kforster/dmdx/download.html 
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Group-A primes have a high degree of relatedness (e.g., Mfal(Majhi(boatman)) -— CBT 
(Nauka(boat))), whereas Group-B primes have a low degree of relatedness (e.g., Ofte 
(Saurabha(Saurabha)) -- CATPI(Nauka(boat))) 


The controls are semantically, orthographically, and morphologically unrelated to the target 
word. As previously said, the participants make lexical judgments depending on the visual probe 
representation. As a result, it's critical to limit the participant's ability to make any strategic guesses 
about the relationship between pairs of words. To avoid strategic guessing, we insert a filler word 
between the literal pairs of words, which are out of dictionary words such as non-words. 


Thus, 600 words pairs of 150-word pairs of Group-A, 150-word pairs of Group-B and 300-word 
pairs of fillers (e.g., *TVYOT (Panas6) -- APIA (Parasana)) are used in the priming experiment. Next 
taking the word pairs, filler words, we distribute the word pairs haphazardly. Then 60-word pairs in 1 
set have been chosen, i.e. 600-word pairs have been chosen in 10 sets. Before beginning the trial, another 
sample of 10-word pairs was chosen for the practice session. The RT of these practice pairs shall be 
exempted from all evaluations in this article. 


3.2.2. Participants 


The tests were carried out on 100 native Bangla speakers with 50 of them pursuing a graduate degree 
and 50 of them belonging to post-graduate standards. The participants range in age from 20 to 30 years 
old. They had perfect vision and no issues with reading. 


3.2.3. Results 


The lexical analysis was not incorporated for the incorrect responses and higher reaction time of the RT 
for the lexical decision (approx. 7.5%). The higher reaction time for each participant is the median 
lexical latency of that essence participant. The average response time (RT) of certain prime-target and 
control-target word pairs is shown in Table 3. 


Now, we compute the inter-annotator agreement (Fleiss Kappa) measure to identify the valid 
data provided by the participants (Fleiss et al. 2013). The value of the inter-annotator agreement is 0.68. 
Here we compute the Degree of priming (DOP) i.e. the average response time (control-target) minus the 
average response time (prime-target) word pairs for each word pair. This is expressed as follows: 


Degree of Priming = RT(C_T) — RT(P_T) (1) 


Table 3. The average response time (RT) of prime-target and control-target word pairs. 





Prime word Target word Control word Average RT Average RT 

(PT) (CT) 

Cola VIC) Ort 548.62 786.89 
Ce canst wa 636.71 716.81 
can carat ary 700.14 821.81 
wel ay aa 612.81 726.96 
fabra faba Gal 669.99 809.42 
wall Wale] IV) 641.97 741.99 
acta atoll ICICI 685.80 780.41 
farm fama ar 748.60 820.14 
ho Ty Bw 594.16 645.64 
BIG AON apy 737.95 773.69 
WY SRE ATSC 605.70 647.03 
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qd CEP yy 611.02 746.99 
ay BIA eth 613.79 736.87 
we Car ay 593.32 729.14 
AGU atl a 689.02 833.11 

BS QIN 614.77 786.92 


We measure the average DOP among all participants after obtaining the DOP result. 


4. Semantic similarity measure using Distributional Approach (Word2vec Model) 


Word embedding is a form of word representation that helps machine learning algorithms to recognize 
words with similar meanings. It is a neural network-based translation of words into vectors of real 
numbers or dimension reduction on the word co-occurrence matrix. It is a method for language modeling 
and feature learning. Word embedding is a neural network-based mapping technique. Word embedding 
models such as word2vec (Google), Glove (Stanford), and fastest (Facebook) are accessible. The 
concept "word embedding" is often used to refer to a distributed semantic model. 


Word2vec model is to generate word embedding for improved word representation. It describes 
syntactic and semantic word relationships with great precision. It is a two-layered shallow neural 
network. The only hidden layer between input and output is the shallow neural network like a deep 
neural network. At Google, a team of researchers led by Mikolov et al. (2013) created Word2vec. 
Word2vec outperforms the latent semantic analysis model in terms of quality and efficiency. Word2vec 
employs two different architectures: a) Continuous bag of words (CBOW) b) Skip-gram. In CBOW, the 
actual term is predicted by looking at the context windows around it. For example, let Wi2, Wi-1, Wi+1, 
Wi+2 are given context, this model will provide wi. In the skip-gram model, prediction of the context is 
performed from a word. For example, let wi be given, this will predict the context Wi-2, Wi-1, Witt, Wi+2. 
One Hot Encoding is a standard method of preprocessing categorical features for machine learning 
techniques. This method of encoding introduces a new binary function for each possible category and 
applies a value of | for each sample corresponding to its original category. We use the one-hot encoding 
of the input word and calculate the output error against the one-hot encoding of the target word. Thus, 
the vector representation of the target word is learned by learning to predict the same target word. Figure 
2 shows the fundamental architecture of the CBOW model. 
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Figure 2: Word2Vec Model Architecture. 
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Now, the word2vec model takes each word from 300 prime-target and control-target pairs as input and 
calculated embedding for the same. In this article, we have trained the word2vec model to create 
embeddings with different dimensions on the corpus discussed in section 3.1. (Model A), and we have 
also used some pre-trained word2vec models, namely Model B’, and Model C°. Using the cosine 
similarity matrix, we find out the semantic similarity score between each of the word pairs based on the 
various sets of embedding. 

Wi. Wj 


we Tl wy ll @) 


cos cos (w;,w;) = 


The scores were then compared to the DOP obtained during the priming experiment. The correlation 
between the various embedding models and the response times acquired from the priming assay is seen 
in Table 4. 


Table 4. A comparison study between different word representation models with human perceptions. 





“Modelname = =VOCAB-~—SC@Dimension’=——sCCorelation = 
Model A 427261 300 0.2422 
Model A 427261 400 -0.2104 
Model B 10059 200 -0.0994 
Model C 145350 300 0.0203 


We can notice from Table 4 that the Word2Vec models and the Reaction Time have a very weak 
correlation. 

Table 5 explains two forms of discrepancies between the word2vec similarity score and the 
priming effect (or the similarity experienced by humans as reported in psycholinguistics experiments): 
Some words have a high degree of priming but a poor word2vec similarity score, and others have a low 
degree of priming but a high word2vec similarity score. 


Table 5. The similarity score of human perception (DOP) and Word2Vec annotated. 


ee 
mead Rencd ca) word oe of priming 


| 238.28 28 | 0.3236 _| 3236 
194.61 0.2506 
127.10 0.3669 
309.81 0.2678 














2 https://github.com/Kyubyong/wordvectors 
2 https://fasttext.cc/docs/en/crawl-vectors. html 
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fags | ee DRG 131.00 0.2584 





It is evident from the kinds of word pairs that we use together in our everyday lives and colloquial use, 


such as “CBI” and “WIPO”, We also encourage a close mental bond between them, as shown by the 
high DOP. On the other hand, these pairs of terms are less likely to co-occur in a formal written corpus, 
they have a low score in the word2vec model. Some word pairs are morphologically related, such as, 
“(HT and “CHAI” show a high degree of priming, but low similarity score measure in the word2vec 
model because they do not frequently co-occur in the corpus. Again some word pairs like “bI¢T’ and 
“let”, which have low dop but high similarity found by the distribution word-embedding model, 
because they are different in meaning and shape but co-occur frequently in most of the corpus. Some 
human characters like “ATN” and “dId¥” show a high degree of priming, but low word2vec similarity 
score because the human brain can capture the relation between these types of word pairs from the 
listening childhood story. As well, some word pairs like “OY” and “<1” have a low degree of priming 
but have high word2vec similarity score; because these types of word pairs co-occur frequently in the 
corpus but they are different as “OY” is a reptile and “<1” is an amphibian. Another type of word 
pairs like ““Ta1§” and “P12” have the same low degree of priming but high semantic similarity score 
because the word “*YA&” is the name of the season and “f*{f*12” is a thin film of water. They have 
different concepts but frequently co-occur in the written corpus. Again some word pairs “Ql” and 
BQ” have low degree of priming and high similarity. These word pairs are different in their 
meaning but co-occur frequently in the context. Similarly, other word pairs such as “91PT and “ATO” 
have high cosign similarity score as they are likely to co-occur in the written corpus, but in daily use, 
not so many. This pair of words reflects the concept of categorization. As a result, our null hypothesis 
proves to be incorrect, and we may conclude that there is already a significant difference in how our 
brain process words and how computational methods believe we do. 


5. Conclusion and Discussion 


This paper aims to explore how the Bangla semantically related words are interpreted and processed in 
the mental lexicon. We have subsequently carried out a semantic priming experiment in order to assess 
the response time of participants for prime-target and control-target word pairs. In addition, we measured 
the semantic similarity of the Bangla word2Vec based word embedding model for the same word pairs. 
We contrasted whether traditional word embedding-based semantic representation models accurately 
reproduce the mental lexicon's structure and processing. According to the response time analysis shows 
that use of corpus-based semantic similarity methods does not demonstrate the true nature of mental 
interpretation and retrieval of words. The paper examines some cases in which it has been shown that 
extremely related words have no priming impact on users, while word pairs with low corpus similarity 
may have a high degree of priming. 
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The current word embedding models are dependent on the underline corpus on which they were 
trained. As a result, to comprehend word representation procedures, the corpus should better resemble 
the human spoken form. However, this ideal condition must have to be approximated. 


In fact, much of the natural language humans are presented with belongs to the form of spoken 
type. The digital transcription of the spoken forms into their corresponding textual versions is needed to 
use such details. It doesn't only take time but requires an immense amount of human resources. On the 
other hand, current available standard textual models are based on the written language that is freely 
accessible on the internet. While they are available in large quantities, they are often less representative 
of the actual language input. A recent paper by Brysbaert et al. (2011) revealed that the word frequency- 
based corpus has been better predicted than the Google frequencies-based corpus in the lexical decision 
project (English Language) ( Balota et al. 2007). As well, the German language has reported the same 
(Brysbaert et al. 2011). In special, language frequencies derived from non-fiction, academic texts work 
less well (Brysbaert et al. 2012). 
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Keywords Abstract 

Numeral Classifier, Classifier, The Bhojpuri Numeral Classifier <e’/ <t'o’/-go’, varying regionally, 
Bangla, Bhojpuri, Classifier- attaches with a numeral and denotes a sum of homogeneous nominal 

less DPs, Classifier- entities, irrespective of their gender, animacy, shape or size. The paper 
compatible DPs, Specificity explores three kinds of environments on the basis of presence or absence of 


the Classifier ‘re’; firstly, the environments where the Classifier has to be 
necessarily attached with the Numeral; secondly, the cases where the 
Classifier is blocked by Measure words, Calendrical terms or units and has 
to be dropped in order to construct acceptable sentences and the third 
environment where the Classifier is optional. The optionality in the third 
environment is addressed in the last section where it is observed that the 
Classifier-compatible DPs differ from the Classifier-less DPs in terms of 
Specificity. However it could also be a result of regional variation or under 
the influence of a dominant non-Classifier language, there is a gradual loss 
in the Classifier forms of Bhojpuri. 


1. Introduction 


Amongst the several unofficial languages of India which are not yet listed in the Constitution 
as ‘Scheduled’ Languages, Bhojpuri deserves a special mention. Bhojpuri is considered as a 
dialect of Hindi for years due to its socio-cultural history although it shares a lot of linguistic 
resemblance with Bangla. 


The language is widely spoken in States like Bihar, North-west of Jharkhand along with 
regions of Uttar Pradesh. It is also used in some parts of Assam, Delhi, Madhya Pradesh and 
West Bengal. Although Bhojpuri is a well-acclaimed language in India, there exists a lot of 
scope for its linguistic research as very less investigation has been carried out about the 
language. 


Classifiers are inflectional affixes that categorise nominal entities under distinct classes on 
the basis of shared characteristics. Classifiers are attached directly to a noun only when they 
denote a unique object. When denoting a sum of homogenous objects a classifier is attached 
to a numeral (definite or indefinite). A numeral plus a classifier constitute a numeral classifier 
in which the numeral gives a number of classified objects. 


Bhojpuri DPs exhibit an extremely productive Numeral Classifier that remains unaccounted 
for in the existing literature of Bhojpuri linguistics. The Bhojpuri Numeral Classifier ‘re’ or 
‘-t"o’ or ‘-go’ varying regionally or phonetically, exhibits interesting properties that are not 
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yet recognised. The Numeral classifier attaches with a numeral and represents a sum of 
homogeneous nominal entities, irrespective of their gender, animacy, shape or size. 


(1) du-t'e laiki aail hain 
two_CL girl come.be.PRS 
Two girls have come. 


(2) Sita-wa car- t'e kitaab laile ha 
Sita-? four-CL book bring.PRS 
Ram has brought four books. 


However, it is observed that the Numeral Classifier behaves differently for different 
environments. Since the Numeral Classifier is not linguistically explored yet, it intrigued us 
to investigate more about the properties of the Classifier and how it behaves in varied 
situations. 


In this paper, three different environments are identified on the basis of presence or absence 
of the Bhojpuri Numeral Classifier ‘-¢'e’ or ‘-t"o’ or ‘-go’. The paper is divided into three 
parts where the first part documents the constructions in the Numeral Classifier Compatible 
environment, the second part points out the constructions where the Classifier is unacceptable 
along with explanations that justify its absence and the third part decodes the reasons for its 
optionality in some structures and the difference in their interpretations. The paper also 
discusses cross-linguistic references that further validate the analysis. 


1.1.Literature review 


As mentioned before, enough linguistic documentation of the Bhojpuri Classifier was not 
available in the literature. However, the brief comparison of the Bhojpuri Numeral Classifier 
“-t"e’ or ‘-t'o’ or ‘-go’and the Bangla Numeral Classifier ‘-ta’ in Mukherjee (2021) served as 
a significant source of data and various properties and behavior of the Classifier in different 
syntactic constructions which helped us a lot in this particular study. As mentioned in 
Mukherjee(2021), the Indefinite structure of the Bangla Cardinal DP bearing the Numeral 
Classifier ‘-ta’ exhibits properties that are similar to the structure of the Bhojpuri Cardinal DP 
bearing the Numeral Classifier “7’e’. 


The structures of the DPs in both the languages are given below: 
Bangla: 
Numeral-Classifier-Noun 


du-to lok 
Two-CL men 
Two men 
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DP 
\ 
D’ 
ae 
e 
du CL NP 
(two) -to lok 
(CL) (man) 
Bhojpuri: 
Numeral-Classifier-Noun 
du -t'e admi 
Two-CL man 
Two men 
DP 
D’ 
D CLP 


du CL NP 
(two) -te admi 
(CL) (men) 


The Bangla and the Bhojpuri Numeral Classifiers ‘-ta’ and ‘-t'e’ respectively both reveal 
atomic properties denoting a count of the distinct nominal entities with the help of the 
number preceding it. The only difference in the aforementioned structures is that the Bangla 
construction in the order Numeral-Classifier-Noun denotes the indefinite DP structure of 
Bangla whereas, for the definite DP construction the word order changes to Noun-Numeral- 
Classifier where the Noun is raised to the Spec of DP; but in Bhojpuri both the definite and 
indefinite structures bearing the Numeral Classifier ‘-t"e’ are expressed in the same order 
Numeral-Classifier-Noun and no raising of the Noun is witnessed. 


As it was intended to explore the three kinds of environments of the Bhojpuri Cardinal DPs 
based on the occurrence of the Numeral Classifier ‘-t’e’, we tried to observe the behaviour of 
the Bangla Classifier ‘fa’ in similar environments of Bangla Cardinal DPs in order to 
develop a concept regarding its nature. It was found that the Bangla Classifier ‘-ta’ is 
compatible with almost all classes of Noun irrespective of its gender, shape, size, antmacy 
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etc. Bhojpuri also acts in a similar way and more detailed study of the same is given in the 
following sections. But when the Noun in the Cardinal DP is a measure word or a Calendrical 
NP, the Classifier is necessarily dropped to maintain acceptability of the phrase or sentence. 
Das Gupta(1983) pointed out that measure words or Calendrical words do not allow the 
presence of a Classifier with the Numeral preceding it. Similar terms in Bhojpuri are 
experimented with in the following sections and similar results are observed. The third 
environment that focuses on the Cardinal DPs which may or may not possess the Numeral 
Classifier is not yet explored in Bangla. Three possible explanations for the third case are 
mentioned in this paper for Bhojpuri although more extensive research would be required to 
develop a generalised theory about it. 


1.2.Research Objectives 


Although in most constructions, the Classifier is necessarily present to produce acceptable 
sentences (like in | and 2), there are certain constructions where it is necessarily dropped or 
is optional. The following examples are quoted from Mukherjee(2021). 


(3) Gita ek bori chawal de gail biva 
Gita one sack rice give be.PST 
Gita has given one sack of rice 


(4) Raj c'e maihina pahile eha rahat rahe 
Raj six months ago here stay be.PST 
Raj used to stay here six months ago 


In the above two constructions, the presence of the Classifier is not allowed as it leads to 
unacceptable sentences. Moreover, if the following sentence is observed, it is found that the 
Classifier may or not be present here. 


(5) Ram du(-t'o) jagah ghume gayal rahe 
Ram two(-CL) place visit go.PST 
Ram went to visit two places. 


Although in (5) both the constructions with or without the Classifier sounds acceptable, a 
difference in interpretation between the two forms were pointed out by the native speakers. 


Thus the objective of this paper is to explore all the three environments. The paper attempts 
to document the environments where the Classifier is acceptable, the situations where it is not 
along with justifications for its absence and also the reasons for its optionality in the third 
environment. 


1.3. Research Questions 


Considering the aforementioned objectives, the following research questions will be 
addressed in this paper: 
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a. In which environments do the Numeral Classifier -¢’o necessarily occur without any 
exceptions? 

b. Which category or set of Nominal entities block the Numeral Classifier and why? 

Where is the presence of the Numeral Classifier optional? 

d. What difference in interpretation is realised in between the Classifier-compatible 
Cardinal DPs and the Classifier-less Cardinal DPs? 


7 


2. Discussion 


2.1 Classifier-Compatible DPs 


The Numeral Classifier ‘eo’ in Bhojpuri occurs regularly in most Cardinal DP constructions 
regardless of their gender, animacy, shape or size. The Classifier directly attaches with the 
Numeral like an affix and refers to an atomic set of classified entities. The structure of the 
Classifier-compatible DP is Numeral-Classifier-Noun. 


Let us discuss some properties of the objects on the basis of which Classifiers generally 
categorise Nominals and observe how the Bhojpuri Classifier ‘-¢’o’ behaves with respect to 
each of them: 


Firstly, animacy is an important property that categorises objects under different classes. 
Animacy can be further subcategorised as Human/Non-Human. Let us observe the following 
examples: 


2.1.1 Animate Noun: 
2.1.1.1 Human: 


(6) Ek-t'e laiki aail hain 
one_CL girl come.be.PRS 
One girl has come. 


(7) Tin-t'e admi baithal hauan. 
Three-CL men sit be.PRS 
Three men are sitting. 


The Classifier is compulsory with all Human Nouns like in sentences 3 and 4. 
2.1.1.2 Non-human: 
(8) Pac- t'e kukkur bhaukat rahalan. 

Five-CL dog bark be.PST.PROG 

Five dogs were barking. 
(9) Hamre ghar-e car- t'e gaiya hav 

my house-LOC four-CL cow be.PRS 


I have four cows at home 


In case of Non-human animate Nouns as well, the Numeral classifier occurs regularly 
without exceptions. 
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2.1.2. Inanimate Noun: 


(10) du- te kursi lava 
two-CL chair bring.PRS 
Bring two chairs. 


(11) hum Sita-ke car- te pen deni 
I Sita-DAT four-CL pen give.PST 
I gave four pens to Sita. 


The Numeral Classifier ‘-¢’e’ is attached with all Inanimate Nouns similarly as found in the 
above sentences. 


Thus it is observed that ‘-¢"e’ is compatible with all kinds of Animate or Inanimate 
Nominals and has to be necessarily attached with the Numeral preceding them to ensure 
grammatically acceptable sentences. 


The inanimate referents can be classified according to their Shapes. The different shapes 
that most commonly distinguish noun classes cross-linguistically are round, broad and flat, 
thin and long etc. Let us observe objects representing each of these properties and find out 
whether the Classifier is present in every case: 


(12)  hum-ke ek-t"e genda da 
I-DAT one-CL ball give.PRS 
Give me a ball. 


(13) Riya car-t'e kitaab kharidni 
Riya four-CL book buy.PST 
Riya bought four books 


(14)  Upehalvan ek-t'e lathi tor delas 
That bodybuilder one-CL stick break do.PST 
The bodybuilder broke a stick. 


As it is witnessed in the examples, the Classifier occurs regularly with differently shaped 
inanimate objects. 


A further objectively perceivable property of an object is its Size. The focus would be on 
the two broad size distinctions only, which are Large and small. 


(15) Ajrasta me hum du-t'e hathi dekhni 
Today road LOC I two-CL elephant see.PST 
I saw two elephants on the way today 


(16) Jerry ek-t'e chuha hav 
Jerry one-CL mouse be.PRS 
Jerry is a mouse 
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Thus as the examples suggest the Classifier is essential in each of these constructions that 
proves that ‘-t"e’ occurs with large as well as small sizes and cannot be dropped at any 
instant. 


Therefore as observed from all the above constructions the classifier is essentially attached 
with the Numeral preceding the Nouns in all these Cardinal DPs without any exceptions 
irrespective of its animacy, shape or size. It should also be noted that the Numeral classifier is 
compatible with both feminine and non-feminine genders as well from sentences 3 and 4 
respectively. 


As of now, we find the Bhojpuri Numeral Classifier ‘-t"e’ displaying resemblance with the 
Bangla Numeral Classifier ‘-ta’ and both occur uniformly with every gender, shape, size or 
animacy. Bhojpuri, just like Bangla, does not interpret singular or plural number marking on 
the nominal. Nevertheless there exist a few exceptions in between the languages. No word 
order reversal between the Numeral-Classifier cluster and the Nominal is allowed in Bhojpuri 
to express definiteness like in Bangla. One more distinction is that the Bangla Classifier ‘-ta’ 
can be directly attached with Nouns (also other categories in special cases) but the Bhojpuri 
‘the’ is purely a Numeral Classifier which is only compatible with Numerals and nothing 
else. However, in the next sections more similarities in between the two languages would be 
observed. 


2.2 Classifier-less DPs: 


Typologically, in Classifier languages, the cardinal terms are not directly linked with the 
noun but a numeral classifier has to intermediate in between the two, irrespective of the 
singular or plural reference of the NP. However there is a considerable number of DPs in 
Bhojpuri which comprises cardinals and nominal expressions but lacks classifiers. The forced 
addition of a Classifier in these constructions sounds odd or unacceptable to native speakers. 
These DPs are named as Classifier-less DPs. 


The following are examples are taken from of Classifier-less DPs! in Bhojpuri: 


(17) Gita ek bori chawal de gail biya 
Gita one sack rice give be.PST 
Gita has given one sack of rice 


(18) Raj c'e maihina pahile eha rahat rahe 
Raj six months ago here stay be.PST 
Raj used to stay here six months ago 


(19)  Waha do kilo ata rakhal hav 
There two kilogram flour keep be.PRS 
Two kg of flour is kept there. 


(20) Jtin din pehle ka sabzi hav 
This three day before GEN vegetable be.PRS 
These vegetables are three days old. 


' The concept of Classifier-less DPs with a list of examples in both Bangla and Bhojpuri is explained in 
Mukherjee (2021). In order to understand further details about each category of the Classifier-less DPs of 
Bhojpuri, the mentioned paper should be consulted. 
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(21) Udu botal pani pi lelas 
He two bottle water drink do.PST 
He drank two bottles of water. 


(22) Hum do feet lathi kharidli 
I two feet stick buy.PST 
I bought a two feet long stick. 


If noticed closely, it is found that Nouns in examples (19) and (22) are units. Examples (17) 
and (21) are referring to some kind of measuring standards. These are known as Measure 
words. Measure words are generally used to measure uncountable or mass nouns that cannot 
be individuated atomically like countable nouns. Examples (18) and (20) are terms that are 
usually found in a Calendar. Hence they are known as Calendrical Nouns. Other examples of 
calendrical nouns are days, weeks etc. 


Das Gupta points out some of these DPs which bear cardinals as well as NPs but no 
classifiers in Bangla. He distinguishes a class of NPs (like the ones mentioned in the previous 
examples) as measure words or calendrical NPs. In Dasgupta (1983), the author mentions that 
“measure words” do not normally take classifiers, except to achieve certain referential or 
contextual effects. Bhojpuri which shares linguistic resemblance with Bangla, also disallows 
occurrence of the Numeral Classifier ‘-’o’ with units, measure words and calendrical words 
as observed in the above data. Measure words and Calendrical terms take up the same 
syntactic position like the Classifier. Hence the Classifier has to be dropped as its position is 
blocked by the former. 


DP DP 


Num CL Noun Num Measure Noun 
Words/Units/ 
Caldendrical 
Terms 


Basically the function of a Classifier in a Classifier language is to count the atomic 
individuals of a set and the function of a measure word is to measure the quantity of 
uncountable mass nouns or sometimes countable nouns which are treated as a mass instead of 
discrete atomic entities. 


Thus it can be concluded that all Calendrical terms, measure words and units in Bhojpuri 
lacks classifiers. Although it may be argued that in some special cases of emphasis or focus, 
the Classifier could be forcibly added to the DP, but it would still sound odd and 
unacceptable in regular speech. 


2.3 Optional Environment: 


The third and the most imprecise environment of the Numeral Classifier ‘-the’ 1s when it can 
be optionally attached or dropped in a given DP with the Numeral preceding the Nominal. 
Although it is optional the two forms i.e, one with the Classifier and the other without the 
Classifier express slight semantic difference as accepted by the native speakers. The third 
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section of this paper seeks to catch this semantic difference and propose a tentative 
explanation for the same. 


(23)  ajtin(-t'e) jagah-se citthi milal 
today three-CL place-ABL receive 
Today I received letters from three places. 


In the above sentence, the Numeral Classifier may or may not be added and in both cases 
the sentences are acceptable or grammatical. However, the native speakers admitted to 
intuitively understand some difference in interpretations in between the two forms of 
constructions. There could be three probable explanations behind this. 


Specificity: In the specific interpretation of a Noun Phrase, the speaker has a particular 
individual/ object/ group of individuals/ objects in mind, whereas in the non-specific reading 
he/she does not. For Example: 


i) Ritu consulted a doctor every month. 


(It could be a particular doctor who she consulted every month and the speaker knows who it 
is.) 


ii) Ritu consulted a doctor every month. 
(There may have been a different doctor every month and the speaker does not know them.) 


In (4) [a doctor] has a broader scope with respect to [every month] that expresses that there 
is only one doctor that she visits, whereas, in (11) [a doctor] has a narrower scope with respect 
to [every month] which expresses that ‘a doctor’ 1s not specific and it may vary every month. 


In English as seen in the above examples, Specificity could be identified in a comparatively 
easier way by scope difference. However such is not the case for Indo-Aryan Languages. 


Epistemic Specificity expresses the contrast between the speaker’s knowledge and 
speaker’s ignorance (or indifference) about the referent. 


Now let us consider the following sentences: 


Table 1. Example Sentences 
(24) Ram du-t'o jagah ghume Speaker knows the name of the Specific 
gayal, France or England places 
Ram went to visit two places, 
France and England 


(25) Ram du jagah ghume gayal, Speaker does not know about the | Non-specific 
kaun kaun nahi pata name of the places 
Ram went to visit two places, I 
don’t know the places. 





It can be tentatively established from the given data that classifier-compatible cardinal DP 
differs from the classifier-less cardinal DP in terms of Epistemic specificity. In 24, the 
speaker has knowledge about the names of the particular two places where Ram has gone and 
hence expresses a specific interpretation. But in 25, the speaker is ignorant about the name of 
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the places and thus a non-specific meaning is expressed. That justifies the slight difference 
the natives were intuitively observing. 


The second explanation involves interplay of sociolinguistic factors that have made the 
native speakers doubtful about their own mother tongue. It is noticed while collecting and 
discussing the data with some informants that dropping the classifier at certain constructions 
could also be a result of some official dominant language (presumably Hindi) influence that 
the native speakers had to learn in school/work environment or even at home as Bhojpuri was 
never the medium of instruction even in primary education due to the stigma attached with 
‘non-standard’ languages in India. The informants themselves admitted the same. Hindi is a 
non-Classifier language and it could be a possible case where due to the impact of Hindi in 
the native speakers’ lives and lack of regular usage of their mother tongue, Bhojpuri, in 
comparison with Hindi, they are losing their traditional Bhojpuri Classifier constructions. 
This brings us to very serious questions about minor language/ dialect depletions under the 
pressure of the major so-called standard languages in the near future. 


The third explanation for the optionality could be for the wide geographical extents of the 
language. Although Bhojpuri is not yet recognized as one of the scheduled languages in 
India, it is commonly spoken in States like Bihar, North-west of Jharkhand along with Uttar 
Pradesh's Purvanchal region and its suburbs. It is also used in some parts of Assam, Delhi, 
Madhya Pradesh and West Bengal. Several dialects have evolved from Bhojpuri language 
including 'Northern Standard Bhojpuri (Gorakhpuri, Sarawaria, Basti), Western Standard 
Bhojpuri (Purbi, Benarsi), Southern Standard Bhojpuri (Kharwar1), Tharu, Madhesi, Domra, 
Musahari etc. As a result of its widespread, the language has developed various regional 
variations. The phonetic variations are evident but the morphological, syntactic and semantic 
variations are yet to be identified. The study we conducted is at its preliminary stage and as of 
now data was collected only from the Purvanchal region of UP. So it can be assumed that in 
sentences 23, 24, 25 the presence of the Classifier is compulsory in some regional dialect or 
variation and unacceptable in others. 


However, to develop a confirmed theory in order to justify the optionality of the Numeral 
Classifier in certain constructions, further exploration and research needs to be conducted. As 
of now, the list of Nouns that are allowing the optionality are not yet confirmed. The list 
might provide more clarity about the operations taking place here. Not only in Bhojpuri but 
also in the neighbouring Classifier languages like Bangla, Assamese etc. such constructions 
exist and needs to be accounted for, so extensive cross-linguistic and comparative studies are 
required to be carried out in the Classifier languages to ensure a comprehensive theory. 


3. Conclusion 


On the basis of presence or absence of the Numeral Classifier ‘-the’ in Bhojpuri, three kinds 
of environments in the language are pointed out. The first one is when the Classifier has to be 
compulsorily added with a Numeral preceding the Nominal entity to produce an acceptable 
sentence. Here, the Classifier occurs uniformly in every Cardinal DP irrespective of the 
gender, animacy, shape or size of the Noun without any exceptions. The second one is when 
the Classifier is necessarily dropped in order to produce acceptable sentences. Measure 
words, units or Calendrical terms blocks the syntactic position the Numeral Classifier and 
hence, addition of the Classifier sounds strange in such constructions. The third and the most 
interesting environment is when the Classifier could be optionally present in certain sentences 
but there lies a difference in interpretation in the two forms. Three probable justifications for 
the difference are either the classifier compatible DPs differ from the Classifier-less DPs in 
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terms of specificity or due to influence of a co-existing dominant non-Classifier language, 
some speakers might have lost the classifier constructions or it could be a matter of regional 
or dialectal variation of the language as it is spoken in many parts of the country. However, 
further research is required to develop a comprehensive theory about the same. 
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Keywords: Abstract 

emerging models of second 

language pedagogy, With the continuous efforts of the researchers across the globe, there are 
ELT scenario, a series of methods and approaches designed so far based on various 
CLT, psychological, socio-cultural, and linguistic principles for bettering 
Odia mediumeducational second language pedagogy. In this way, some of the emerging modelslike 
institutions, 


Evidence-Based Second Language Pedagogy, Post-method Pedagogy and 
Exploratory Methods of Second Language Teaching with eclecticism 
happen to be the contemporary practical and scholarly outcomes and are 
considered to be the grounds for designing syllabus and evaluating the 
teaching and learning of second language. In this way, the present paper 


classroom teaching, 
proficiency assessment 


focuseson the contemporary ELT scenario in the state of Odisha taking 
into critical consideration of the parameters of the pedagogies outlined 
above. Firstly, it gives an overview of the contemporary models of 
pedagogies of second language. Then, it focuses on the ELT scenario in 
Odisha at primary, secondary and tertiary levels which has been mostly 
based on some traditional methods and recently Communicative 
Language Teaching in which output is not yet satisfactory. Then, it 
presents the problems and possibilities associated with the teaching- 
learning system in the Odia medium educational institutions and examines 
if such models are in use or not. The study includes the second language 
learning outcomes on the basis of the observations of the current Odishan 
socio-cultural, linguistic environments, classroom teaching as well as 
proficiency assessment of the students at different levels and provides a set 
of recommendations. 


1. Introduction 


The state of Odisha has demography of around 4.5 billion people. People speak standard Odia 
(in the coastal districts), and its respective dialects like Ganjami or Dakshini Odia (southern 
Odisha), Samabalpuri, or Koshali (western Odisha), Baleswari Odia (northern Odisha). There 
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are many speakers of tribal languages such as Santhali, Ho, Sora, and Kui. They constitute 
22.85% of the total population of the state. When it comes to learning English, the mother 
tongue (MTI) influence creates the problem. On an average, some of the general 
psychological, socio-cultural, and pedagogical factors influencing the rate of learning a second 
language in Odisha are listed below: 


i Age and time of entry into the second language learning environment 

il. Personality and learning style (Oxford & Anderson, 1995) 

ill. Attitude and motivation to learn the new language (Dornyei, 1990) 

iv. Previous exposure to and experience in the second language and the new culture 
(Kiatkheeree, 2018) 


V. Teaching Methodology/Training mostly adopted from the so-called methods and 
approaches. 

Vi. Lack of sustained efforts to improve teaching and to learn in English 

Vil. Mere listening to the teacher in the class makes the students bored. 


vill. | English is still treated as a subject to be taught, not to be learned. 
An overview of ELT in Classrooms in Odisha can be presented in the following manner: 
I. At the Primary Level 


i. Students start learning the alphabet of Odia and English simultaneously. 

ii. Nobody makes them aware of the sounds and the phonics properly. 

ii. In conversations in both the languages, stress, and intonation are not clearly emphasized. 
iv. Made to memorize the meaning of words; sentence structures; rules of grammar with some 
practice 


v. Mechanically reproducible answers to the questions are encouraged. 

vi. No additional general and socio-cultural and creative activities are conducted or carried 
outto improve general conversational skills as outlined in the parameters of the essential 
interpersonal skills. 


II. At the Secondary Level 


i. The textual gravity in English based on detailed and non-detailed studies remains pressure, 
and therefore, students mostly try to memorize the fact and/or style-based answers of both short 
and longtypes. 


ii. For vocabulary, grammar, and composition, practice sets are done, or maybe rules are 
selectively memorized. 


iii. The main focus of learning and improving fluency and proficiency in English language 
skills is greatly ignored. 


IH. Tertiary Level 

i. Focus is on major or core or honours subjects. 

ii. English is mere a pre-exam question-answer type preparation subject 
ili. Functional English is prescribed but not appropriately practiced. 


In this context, it is worthwhile to focus on the existing problems in the following points: 


i. Usually, both at primary and secondary levels, apart from the emphasis on textbook reading 
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(only), teachers of English use the bilingual method/GTM to provide instruction. 


ii. Most often, they translate the words and sentences from Odia to English and vice versa. 


ii. It is achallenging task to deal with Englishlanguage teaching and learning in a multilingual 
and multicultural setup. 


iv. In rural and semi-urban areas and in the illiterate or semi-literate setups both tribaland non- 
tribal areas of Odisha, the learners are not mature enough to develop so much interestin 
English language learning in an environment where they speak their L1 or mother tongue 
(M.T.). 


v. Problems like lack of proper motivation or ideation at the early stage of learning; lack of 
proper mental coding and decoding; and insufficient attempt or input of achieving fluency. 


vi. The students fail to achieve the most minimum standard of fluency at primary and 
secondary school levels. 


vii. Spoken Odia versus spoken English is a long lasting issue primarily associated with the 
vernacular medium schools. 

vill. Reluctance to speak, shyness or nervousness, lack of confidence leading to several 
mistakes in English-speaking activities. 


ix. Specific problems and prospective of local linguistic and socio-cultural setups in the non- 
native areas have not yet been focused on. 


x. We have a general national language policy to adopt for which the local problems have never 
been taken care of with a long-lasting solution. 


xii. Local problems associated with English language learning are squeezed in between the 
ELTprinciples and national language policies. 


xii. The objectives of national policy on ELT at the primary level are clearly outlined in the 
NCERT Position Paper on English (2006) and the Project Document of NCERT on “Teaching 
of English at Primary Level in Government Schools” (2012). However, they cannot be 
comprehensive enough to sufficiently focus on the typical local problems which have been 
taking place since the beginning. Just like in Odisha, the western, eastern, northern, and 
southern parts have specific English language learning problems. 


The objectives of this research are as follows: 


i. To see if the specific linguistic and pedagogical problems of teachers and learners of 
English can be pointed out. 


ii. To examine if some customized syllabus in English can be used to improve the learners' 
communication skills. 


iii. Can the customized syllabus be taught adopting emerging models of second language 
pedagogy? 
1.1 Literature Review 

1.1.1 Emerging Models Second Language Pedagogy 


Based on his Bangalore project, Prabhu (1987) states the following things regarding second 
language pedagogy. 
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i. What a teacher does in the classroom is not solely or even primarily determined bythe 
teaching method he or she intends to follow. There is a complex of other forcesat play in varied 
forms and degrees. 


ii. There is also a sense of loyalty to the past —both to the pattern of teaching which the teacher 
experienced when he or she was a student and to the pattern of his or her teaching in the past. 


iii. One further factor in the teacher's ‘mental mix' is how classroom activity leads to the desired 
outcome of learning. 


iv. A teacher's 'intuition' can perhaps be the perception that he or she identifies with (or feels 
a sense of plausibility about) in an unarticulated state. 


v. Impact of innovations i.e. a new perception in pedagogy, implying a different pattern of 
classroom activity, is an intruder into teachers’ mental frames. 


vi. Language teaching specialism developed by professional debating among teacher 
community. 


vii. Eclecticism, i.e., an exercise of worldly wisdom, is a matter of operating with a 
combination of perceptions or procedures that, though all different and perhaps arguably 
inconsistent with others. (Prabhu, 1987, pp.105-110). 


From the points mentioned above, it is concluded that no methods, approaches and techniques 
are entirely suitable or enough and complete in itself for the teaching and learning of second 
language. It is rather the customised methods and approaches, tools and techniques that are of 
greater use contextually. 


1.1.2 Focus on Post-Method Pedagogy 


As Kumaravadivelu (2001) outlined, post-method pedagogy can be visualized as a three- 
dimensional system consisting of three pedagogic parameters: particularity, practicality, and 
possibility. They intertwine and interrelate in a synergic relationship where the sum total of 
the learning outcome is much more than the sum of its parts. 


i. The Parameter of Particularity 


The most important aspect of post- method pedagogy is its particularity that is any post 
method pedagogy must be insightful to a particular group of teachers teaching a particular 
group of learners pursuing a particular set of goals within a definite institutional context set 
in a particular socio-cultural environment. 


Teachers are required to develop a critical awareness of local learning and teaching 
conditions and policymakers and program administrators have to consider in putting together 
a befitting teaching agenda. Moreover, it involves practicing teachers, either individuallyor 
collectively, observing as well as experiencing their teaching acts, “evaluating their outcomes, 
identifying problems, finding solutions, and trying them out to see once again what works and 
what does not. In that sense, the particular is so deeply embedded in the practice and cannot 
be achieved or understood without it.” (Kumaravadivelu, 2001, p.539). 


ii. The Parameter of Practicality 
It refers to the fact that if context-related pedagogic knowledge has to materialize from 


teachers and their practice of regular teaching, then they ought to be able to develop a concrete set 
of rules or guidelines from their practiceand subsequently apply to their practice. He says: 
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.... This objective, however, cannot be achieved simply by asking them to put into 
practice professional theories proposed by others. It can be achieved only by helping 
them develop the knowledge and skill, attitude, and autonomy necessary to construct 
their context-sensitive theory of practice. 


Teachers’ sense-making matures over time as they learn to cope with competing pulls 
and pressures representing the content and character of professional preparation, 
personal beliefs, institutional constraints, learner expectations, assessment instruments, 
and other factors. This seemingly instinctive and idiosyncratic nature of the teacher’s 
sense-making disguises that it is formed and re-formed by the pedagogic factors 
governing the microcosm of the classroom as well as by the larger socio-political forces 
coming out from outside. In this sense,the parameter of practicality metamorphoses 

into the parameter of possibility. (Kumaravadivelu, 2001, pp. 540-41). 


iii. The Parameter of Possibility 


The parameter of possibility is related much of its ideas to the educational philosophy of the 
Brazilian intellectual Paulo Freire. He stresses the importance of approving and considering 
students' and teachers' identity substantially, and then encourages them to question the 
existing causes that keeps them subdued. He also puts emphasis on the need to develop 
theories, design the structures of knowledge, and social practices that work with people's 
experiences to the educational setting. The experiences partakers bring to the “pedagogical 
setting are shaped not just by what they experience in the classroom but also by the broader 
social, economic, and political environment they grow up in. These experiences can 
potentially alter classroom aims and activities in ways unintended and unexpected by policy 
planners, curriculum designers, or textbook producers.” (Kumravadivelu, 2001, p.543). 


In sum, the three pedagogic parameters of particularity, practicality, and possibility constitute 
the conceptual foundation for post method pedagogy. They have the potential to function as 
operating principles, guiding various aspects of L2 learning and teaching. 


iv. The Post-Method Learner 


Post-method pedagogy adopts the resources of learners of the language and considers learners 
interest bygiving them, to the appropriate level for a meaningful role in pedagogic decision 
making. Post-method pedagogy permits learners to get involved and take roles in pedagogic 
decision making by treating them as ‘active and autonomous players.’ 


Post-method pedagogy reflects on two ideas of learner autonomy. They are a narrow view 
and a broad view. The narrow view seeks to develop a capacity to learn in the learner, whereas 
the broad view goes beyond that to include a capacity to learn to liberate. “Helping learners 
learn to learn involves developing the abilityto take charge of one’s learning. To take 
charge, means to (a) have and to hold the responsibility for determining learning objectives, 
(b) for defining contents andprogressions, (c) for selecting methods and techniques to be used, 
(d) for monitoring the procedure of acquisition, and finally, (e) for evaluating what has been 
acquired.” (Kumaravadivelu, 2001, p.47). 


v. The Post-Method Teacher 
The post-method teacher is approved to be a self-directed teacher. Teacher autonomy is felt 


to be so instrumental that it can be taken as defining the spirit of post-method pedagogy. In 
fact, method-based pedagogy does not focus on any specific needs. But, post-method 
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pedagogy considers the teachers’ prior knowledge and their potential as resources bknow not 
only how to teach but also know how to act independently “within the academic and 
administrative constraints imposed by institutions, curricula, and textbooks. It also promotes 
the ability of teachers to know how to develop a reflective approach to their teaching, analyze 
and evaluate their teaching acts, initiate change in their classroom, and monitor the effects of 
such changes.” (Kumaravadivelu, 2001, pp.549). 


1.1.3 Evidence-Based Second Language Pedagogy as a Model 


It is a forward-looking collection of empirical research conducted (Ed. Mashatoshi & 
Loewen, 2019) focusing on instructed second language acquisition (ISLA) and offering a 
direct contribution to second language pedagogy by closing the gap between research and 
practice. 


i. It talks of Concept-based instruction (C-BJ]), which has been shown to be an effective means 
of enabling learners to use language effectively and creatively (Lantolf & Poehner, 2014). What 
is needed, however, is for teacher education programs to prepare teachers to implement 
instructional programs that take account of C-BI principles in their language classrooms. 


ii. It outlines two instructional approaches providing authentic, pragmatic input—corpus- 
basedmaterials (materials prepared by teachers from corpus excerpts) and corpus searches 
(teacher-supported hands-on searches of a corpus by learners) 


ii. Content and Language Integrated Learning (CLIL) and the study examines the associations 
between two language aptitude components—language analytic ability and working memory. 


iv. The effectiveness of written synchronous computer-mediated communication (SCMC) is 
asuitable strategy 

v. It talks of Foreign Language Enjoyment (FLE) and Foreign Language Classroom Anxiety 
(FLCA) which were negatively correlated but essentially independent dimensions. The 
currentstudy confirms this finding with new additional empirical evidence. 


vi. L2 Motivational Self System (L2MSS), learners’ future self-images (i.e., vision) 
substantiate their motivation and motivated behaviours. 


1.1.4 Critical Perspectives on English Language Learning in Bilingual/Multilingual Context 
i. Durairajan (2014) says that for many of our learners, English is not their language of 
thinking. However, this does not mean that they do not possess higher order cognitive skills. 
ii. They do possess these skills but in their first or more enabled languages. 

iii. To enable our learners to make connections and draw on their thinking capabilities, we need 
them to exploit their abilities in other languages in English classrooms. 


iv. The first or more enabled language is used in English classrooms but rarely as a positive 
resource by the English teacher. 


v. Oliveira (2014), in this context, says that we have to show respect for, and interest in 
diverse students’ home languages. 


vi. In second-language-learning contexts, where L.L.s are learning the dominant language, it 
isessential for teachers to show respect for and have interest in L.L.s' home languages andto 
send caring and welcoming messages about L.L.s' linguistic resources and what they bring to 
the learning environment. 
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1.2 Research Questions 


i. Can the specific linguistic and pedagogical problems of teachers and learners in the Odishan 
context be pointed out and categorized under ELT research parameters? 


ii. Can customized syllabi for different levels be implemented in Odia medium educational 
institutions adopting the emerging second language pedagogy models and obtain sound 
output? 


2. Article Structure 


The article importantly constitutes four parts. Introduction focuses basically o the empirical 
evidence and experiential data regarding the topic of research, including objectives of the 
research. In Literature Review, a critical discussion has been made focusing on the models 
of emerging second language pedagogy. Then, following the research methodology 
parameters in applied linguistics, data has been collected, presented and analyzed. Finally, a 
set of recommendations have been made to improve and update the teaching and learning of 
English in Odisha. 


3. Method: Observational and experimental, descriptive and qualitative 
3.1 Sample/Participants 


Participants include the teachers of the English language as well as students learning/studying 
English at primary, secondary and tertiary level academic institutions in Odisha, those who 
are chosen randomly. Apart from this, some classroom observations have also contributed to 
the availability of data. 


3.2 Instruments 


The instruments have been taken from three pedagogical sources. They are eclecticism, post- 
method pedagogy, and evidence-based language teaching. In addition, personal interviews with 
both teachers and students of English in Odisha help develop critical inquiry into the matter. 


3.3 Data Collection Procedure 


Some of the methods of collecting data include interviews, classroom observations, and 
literature reviews. 


3.4 Data Analysis 


3.4.1 Outcomes of a Classroom Activity 


A class was conducted, taking 20 students of class four from rural vernacular schoolswho 
were chosen randomly. 


i. They usually speak different dialects of Odia, of their spoken English knowledge is very 
low. 


ii. They were made to practice activities related to all the four skills in English 


iii. Initially, the practice was done entirely in English to English. The result was not satisfactory. 
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iv. Then, the practice was done explaining to them all the contexts in their local Odia dialects 
by therespective facilitators who speak the dialects and then using very little of Odia allowing 
code-switching and pattern practice (Scott Thornbury) in English was done with appropriate gestures. 


v. Local cultural contexts were given priority for cognition. 


vi. The students were made to think, understand and feel free. This time there was marked 
improvement noticed among 70% of the students. 


vii. This activity was then extended to some more media and culture related experiences of 
thestudents as well as concise storytelling and writing activities. The students were curious to 
take part. This observation coincides with the research report of Anderson and Lightfoot 
(2018). 


3.4.2 Results of Adopting Models of Emerging Second Language pedagogy 


i. English language teachers adopting eclectic strategies; parameters of particularity, 
practicality and possibility; adopting Evidence-based SL pedagogy like Content and Language 
Integrated Learning (CLIL) happen to feel significantly immersed in the system and feel free 
to deal with classroom activities that were not possible because of lack of freedom from 
prescribed textual guidelines of teaching and learning. 


ii. Students at all levels intend to learn English just like their respective mother tongues and 
not in an isolated manner giving importance to language learning items theoretically and 
textually. It is rather their cognitive, as well as cultural constructs that work better through 
extended interactive programs of language learning. 


4. Discussion 


4.1 Critical Focus on the Current Status 


Despite a strong ‘instrumental’ motivation and 'positive' attitude to English language learning, 
the use of the language is confined mainly to the formal learning context. At the social level, 
the English language has the status of a cow, highly venerated (as the language of privilege, 
prestige, and opportunity) but perennially in a state of neglect (never used as a medium of social 
interaction or cultural communication). 


English is the official second language in the school curriculum. It is also used as the choice 
language for higher education, employment, administration and court. English in speechis 
confined to classrooms or scholarly gatherings such as seminars, symposiums, or international 
gatherings. As reported by British Council in its The English Impact Report (2013), in India, 
92.07% of schools up to the Secondary level use the vernacular language as the medium of 
instruction. Only a few private elite schools catering to the higher-class clientele use English 
as the medium of instruction. They offer suitable English language learning contexts in 
varying degrees. In Odisha, these schools, like anywhere else, also prioritise English 
language learning and create an English language learning environment where learners get 
exposure to standard English from the Nursery class onward and scope to use English in all 
co-curricular and extracurricular activities. They are inspired by the teachers who follow the 
school philosophy of allowing conversation only in English, both with teachers and friends 
during school hours. This practice helps the learners acquire communicative skills and 
confidence tospeak, interact and inquire in English. 


On the other hand, students of vernacular medium schools neither learn CALP (Cognitive 
Academic Language Proficiency) nor BICS (Basic Interpersonal Conversational Skill). They 
get limited exposure to English only in the English classes where writing the alphabet and 
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learning the spelling of words take precedence over learningthe pronunciation, intonation, 
rhythm and vocabulary of English. So, they acquire part of the language skills, especially 
reading and writing but not even regular communication. It is pertinent to mention that though 
the English language is accepted as the second language in schools, it is still a foreign 
language in the informal learning context, i.e. the home, the society, the state and the nation. 
According to researchers, the people of Odisha have a strongpositive attitude to the learning 
of English. It is viewed as an instrument for material prosperityand social climbing. 


4.2 English Language Learning Context 


Exposure to the target language and the opportunity to use it is necessary for improving 
Communicative Competence (CC). The type and quality of English language learning 
opportunities available to the learners of Odisha are definitely not encouraging. The two main 
objectives, Teacher Proficiency (TP) and English Environment (EE) (NCERT, 2005) of a 
formal second language learning situation determine the quantity and quality of exposure as well 
as scopes to use the language. In this context, Odisha again gives a poor picture in this regard 
because substantial numbers of learners study in schools run by the government, the 
municipality or local private organizations, where both TP and EE are unsatisfactory. 


Research observations on the performances of the CC of the learners of Odisha are alike in 
essence. It consistently specifies that the two passive language skills i.e. listening and reading 
are more improved than the two active skills, i.e. speaking and writing. According to 
Mahfooz’s report, “...the learners of higher secondary classes in Orissa, in general, have 
average to good English language competence in readingand listening skills. In comparison, 
they are poor to average in speaking and writings skills.” (Mahfooz, 2013, p. 260) However, 
Mishra's (2013) report on the CC of the tribal learners of Western Odisha reveals different 
scene. In this context, he says: 


A more conclusive finding of the study is thatthe English language learning context of 
the tribal students is far from favourable. Their socio-economic, linguistic background 
does not provide enough opportunities to acquire or use the English language. Though 
theoretically, their English language learning starts at class IV, still by the time they 
complete class X, they have not learnt the basic skills, as is evident fromtheir 
performance in the CCT (Communicative Competency Test) (Mishra, 2013, p.77). 


It was observed that the English language learning context of Odisha, both institutional and 
social are relatively different according to geographical setups, the mode and medium of 
instruction at the primary school level, the institutional norms adopted in relation to 
English language teaching, the proficiency of teachers as facilitators, the socio-cultural and 
linguistic background of the learners, the degree of exposure to the language and the scope 
to use the language both in speaking and writing. The better the learning context, the better 
is the competence. 


5. Conclusions 
Taking references from the emerging second language pedagogy of Prabhu (1982), 


Kumaravadivelu (2008) and Sato and Loewen (2019), the following points can be considered 
Vital. 
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i. Need proper motivation (L2MSS) to create an encouraging environment concerning social 
and professional requirements. Teachers have to be resourceful, exciting and intuitive in 
creating good conceptual clarity and dealing with English language teaching. 

ii. Use local socio-cultural, linguistic and pedagogical tools to design the classroom activities 
so that learners will enjoy and not suffer. This prcess can be achieved through periodical 
training and debating among the language teachers. 

ii. The prescribed textbooks in Odia language and literature should also be made available in 
their respective English translations to simultaneously read and do the language activities. 

iv. Code-switching and code-mixing to be encouraged with proper corrective feedback(C.F.). 
v. Learner autonomy should be encouraged using electronic gadgets like electronic media and 
mobile phones. 

vi. The linguistic features of Odia and its respective regional dialects should be studiedbased on 
similarity and comparison to make the learners aware and avoid interference. 


vii. Keeping in view the local, national and international requirements, a functional syllabus 
can be prepared following a second language management design that aims to manage specific 
items of vocabulary, grammar and language skills with aconcrete target. 
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WordNet, Expansion, . 
Ontology, Synsets, Mental WordNets have emerged as a crucial resource developed for Natural 


Lexicon, Natural Language Language applications. The Bhojpuri WordNet is a WordNet styled 
lexico-semantic database for Bhojpuri language. It has been developed 
following the style, methods and strategies almost the same as that of the 
Hindi WordNet and the Princeton’s English WordNet, so it has been 
created by following the expansion approach from Hindi WordNet. In this 
paper we have made an attempt to develop Bhojpuri synsets and the 
procedures which we followed to create the Bhojpuri synsets which 
consists of concept, gloss, examples and synonyms. It also discusses the 
problems faced during the process of building synsets and includes basic 
semantic relations in it. Such kinds of efforts create an extremely rich and 
vast lexical resource which will be an indispensable resource to develop 
Bhojpuri WordNet. This kind of lexical resource is useful to solve natural 
language processing (NLP) problems like Word Sense Disambiguation 
(WSD), Machine Translation (MT) and Information Retrieval (IR) 
system. 


1. Introduction 

WordNet is a lexical database of synsets and semantic relations. It unites the words into sets 
of synonyms called synsets, provides short, general definitions, and records various semantic 
relations between these synonym sets. It is more than a dictionary and it is fashioned as a lexicon 
based on psycholinguistic principles. Each synset expresses a distinct concept. Synsets are the 
basic building blocks of WordNet (Bhattacharya, 2006). Synsets are sets of synonyms. They 
are linked by semantic relations like hypernymy, meronymy, troponymy etc. WordNet is 
considered to be the most important resource available to researchers in computational 
linguistics, text analysis, and many related areas (Bhattacharya et al., 2010). Till now WordNet 
along with English is available for 18 Indian languages and for 200 + major languages of the 
world. But there is limited electronic resource for Bhojpuri. Bhojpuri WordNet is to be 
developed using the expansion approach from HindiWordNet. We need large knowledge banks 
like WordNet, EuroWordNet, IndoWordNet and lexicons of high quality and coverage. By 
considering this lexical resource we are interested in developing Bhojpuri WordNet. We would 
be following the expansion approach from Hindi to develop Bhojpuri Wordnet. And because 
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of its linked approach, it can be beneficial for multilingual translation and dictionaries across 
Indian languages. Secondary interface with WX script will not only increase the readability but 
would also be interesting for non-natives. 

The 21“ century is high era and of the Information technology. This is facilitated with 
development and convergence of computing, telecommunication and multilingual Information. 
People have been using computers and the internet in their own languages. So far, we Indians 
are forced to use them in English or in a few Indian languages. Because of less availability of 
information in our languages, our socio-economic condition is least improved. As we know that 
people are more creative and cognitive in their own tongue, so there is a need to support our 
languages on the technological front. Because of less availability of information in our 
languages, our socio-economic condition is least improved. There is a requirement to have 
large scale online resources for natural language processing across languages; in general, for 
Bihari languages too. 

The main objective of this work is to discuss and find out an efficient method to build a 
wordnet for the Bhojpuri language. Since Bhojpuri is still far away from being processed 
computationally. The digitalization of the language will encourage future research in it as there 
are many precious scriptures available in this language that hasn’t been processed yet. Simply 
the goal of the study is to contribute: Machine translation systems (MTs), multilingual 
dictionaries among Indian language, Language specific Synsets, Word Sense Disambiguation, 
Automatic text analysis, AI applications and Corpus study. 


1.1. Literature review 

English WordNet was the first wordnet in the world which was built at Princeton University. 
Later wordnets for European Languages like EuroWordNet (Vossen, 1998) have been 
developed. Polish WordNet was also being mapped to Princeton WordNet based on the strategy 
followed by IndoWordNet. The first wordnet for Indian Language was HindiWordNet emerged 
in 2006. Since then, wordnets for a number of Indian languages have been developed, in parallel 
with HindiWordNet (Narayan, 2001) at IIT Bombay. Project as IndoWordNet was initiated by 
DeiTY that was a practice to bring all Indian languages under one umbrella and accommodate 
all conceptual diversities within a single framework. It was developed in three subcomponents; 
The 
North East WordNet was developed for Hindi, Marathi, Sanskrit, Assami, Bodo, Manipuri and 
Nepali, The Dravidian WordNet worked for Telugu, Tamil, Kannada and Malayalam and The 
Indradhanush project consists of Gujarati, Kashmiri, Konkani, Bengali, Oriya, Punjabi, and 
Urdu. IndoWordNet has been developed as a linked-mode multilingual project for 18 major 
languages of India and linked with English WordNet. Nowadays, wordnets are available for 
200 languages of the world. Seeing the enormous potential of wordnet, 18 languages of India 
made their wordnets under the flagship of IIT Bombay. These wordnets have been created by 
following the expansion approach from Hindi WordNet. 

EuroWordNet: (Vossen, 1998): It was a multilingual database consisting 8 Europian 

languages was created in 1996 — 1999. Each concepts were linked to the closest synset in the 
Prenciton WordNet so it allowed cross-language information retrieval from one language to 
another. 
IndoWordNet: It is similar to EuroWordNet. However, the pivot language is Hindi which was 
also linked to the English WordNet. It began in 2000 adopting the expansion approach. It covers 
typical Indian language complex phenomena like complex predicates and causative verbs. Due 
to the morphological richness and different cultural traits of Indian languages linkage approach 
was also adopted. 
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PLWordNet: plWordNet differs from most of the other WordNets: lexical units rather than 
synsets are the basic building blocks. The addition of new material to p|WordNet is consistently 
informed by semantic relations and by various analyses of large corpora. The mapping is 
difficult because of the subtly distinct structures and because of WordNet’s focus on synsets. 
Synsets in plWordNet are merely groups of similarly interconnected lexical units. 
SYNSKARTA is an online interface for synsets creation following the expansion approach 
specific to Sanskrit. This is based on relation database management system and developed via 
PHP. No data redundancy and inconsistency so easy to maintain. It also consists of etymology, 
references and expectancy for the words along with most of the features of IndoWordNet and 
WordNet. 

FinnWordNet ( 2 lacks words only in 100 days). BabelNet ( Navigli, 271 languages, 2010). 

Some other projects that have also been developed as knowledgebase of semantic 
database; like 

MindNet () MindNet was designed to be continuously extended. It was first built out 
of the LDOCE and later included American Heritage and the full text of Microsoft Encarta. The 
underlying technology is based on the same parser used in the Microsoft Word grammar 
checker and was deployed in the natural language. 

ImageNet (2006) It is an image database organized according to the WordNet hierarchy 
(currently only the nouns). It contains possibly 1000 images for each synsets of the WordNet. 

VerbNet (2006) It was motivated by Levin B’s English Verb classes and alternations- 
1993 and a lexicon that groups verbs based on their semantic/syntactic linking behavior. 

PropBank (2002) A Verb oriented resource of one million words of English text, 
annotated with argument role labels for verbs; and a lexicon defining those argument roles on 
a per-verb basis. 

HowNet (Zhendong Dong and Qiang Dong, 1999) is an on-line common-sense 
knowledge base unveiling inter-conceptual relations and inter-attribute relations of concepts as 
connoting in lexicons of the Chinese and their English equivalents. 

ConceptNet (Hsu and Chen, 1999) is developed by MIT Media Laboratory and the 
largest commonsense knowledgebase through OMCS corpus. It covers commonsense 
knowledge to describe the real world; there are 20 kinds of relations categorized as causal, 
spatial, functional, etc. As commonsense knowledge is deeply context-sensitive, the suitability 
of ConceptNet for query expansion is still not clear. 

FrameNet (1998) The Project is built as a Semantic Frame of English that is both 
human and MR, based on annotating examples of how words are used in actual texts. FrameNet 
is mainly composed of three parts - Definition, FEs and Frame-frame Relations. 


1.2. Research question 

In this paper, the problem of constructing a large scale and high quality WordNet for a low- 
resource language Bhojpuri is studied. Bhojpuri WordNet can be used in Natural Language 
Processing, Information Retrieval and other related areas. Hindi WordNet can be used to create 
Bhojpuri WordNet by following the Expansion approach. Expansion approach is being used as 
both Hindi and Bhojpuri belong to the same language family and the Bhojpuri WordNet making 
process would become well guided and can save an enormous amount of time using less 
resources. So the preferred approach is to utilize WordNets in other languages by adopting their 
structure and translating their content. 


2. Article structure 
This paper is divided in into 7 sections. Sections are also divided into many sub sections. In 
the previous section we have already discussed the literature reviews. Section 3 presents the 
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method to develop Bhojpuri WordNet and sub section 3.1 discusses data sample, 3.2 deals with 
the instrument used for Bhojpuri WordNet. Sub section 3.3 tells data collection and analysis. 
Section 4 talks of the development of the Bhojpuri WordNet. Section 5 deals with the findings 
of the study and winds up discussion and section 5.1 presents Bhojpuri statistics. And finally, 
section 6 presents the conclusion and further scope of the study. 


3. Method 

WordNets can be built using the merge approach or the expansion approach (Vossen 1998). 
The merge approach is time-consuming and exhaustive. The expansion approach is very helpful 
and wasy for closely related or related-family languages. As we know that the morphological 
richness of language A than the language B, language A for its synsets might have less or more 
synonyms and less or more concepts for a word. Since Bhojpuri belongs to the Indo-Aryan 
language family and considered as a dialect of Hindi language. It would be better to follow The 
Extension Approach in the development of BhojpuriWordNet. Being from the same family 
member, almost most of the concepts of Hindi would be available in Bhojpuri so it can be easily 
linked to with Bhojpuri synsets. We would be following the hybrid method of the expansion 
approach along with linkage approach. Linking synsets will facilitate cross lingual translation 
and there is no blockage for Information sharing among them. Interface for BhojpuriWordNet 
would be in Matrix fashion and using the MCR principle of Wordnets. 


Word of the SL <> SYNSET <> Word of the TL 

3.1. Data sample 

For the creation of the Bhojpuri WordNet, Hindi WordNet’s synsets were taken as a source 
and using the expansion approach we incorporated the equivalent synsets for Bhojpuri 
language. As expansion approach allows deletion or addition of the synsets based on the 
existing concept in the language. The synsets of Hindi WordNet are modified to the synsets of 
Bhojpuri WordNet through addition or deletion of synonyms in the synset. Some native 
speakers were also participated for more assurance of different usages of a particular word. 


The following steps were taken during the development of the Bhojpuri WordNet 

- Synset were accessed from the source Hindi WordNet and studied carefully and 
corresponding synset was created for Bhojpuri. 

- Synonyms words for that specific concept were taken from the lexical stock of Bhojpuri. 

- The concept definition, gloss and examples were translated into Bhojpuri for identical 
glosses. 

- Then the ready synsets were validated through different existing lexical database. Of 
Bhojpuri. Concept mismatch, spelling mistakes, and example relevance were also 
checked with respect to Bhojpuri language. 


3.2. Instrument 
Synsets are kept by following the principle rule of MCR. To find different usages for a 
concept we tested it on Bhojpuri corpus of different domain too. Hindi corpus sample 
downloaded from TDIL corpora were also translated in the target language and checked for 
MCR principle of WordNet. Since Bhojpuri is morphologically less rich language and a little 
electronic resource are available for it. So research work can be affected due to many reasons. 
Some problems are as follow. 
Morphologically less rich language and no corpus is available. 
Few literary texts and mostly in spoken form is available. 
Tagged and Segmented corpus is not available. 
Font and encoding problem is to the some extent. 
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3.3. Data collection and analysis 


Since Bhojpuri Wordnet is to be built in expansion approach, we have to; to retain the PoS 
in the expansion approach and/or to create different PoS for Bhojpuri and allow cross PoS 
linkage in BWN. During synset creation for Bhojpuri we would follow simply these steps: - 

O Take synsets of Hindi Wordnet and prepare Bhojpuri synset for that lexical concept. 

O Collect syno from lexical resources, Bilingual Dictionaries and Spoken form of 


language and No separate ID creation. 


O Arrange them according to their frequency and but No change in Ontology nodes and 


PoS. 
Source language: Hindi 


Synset ID : 196 POS : Noun 





syno : Ofdd, AAS, ARG, sedi, Ros, UA 





Gloss : HAM oc al Bae AS Hg Ue 





Example : 6 Alda HWS Hers Viel 6 
Gloss in English : a human being; "there was too 


much for one person to do. 


Synset ID : 858 POS : Noun 





Syno : sel, YSN, 4a, RU, Ga 


Gloss : RUlld GI AIy 


Example : seat afk sik GT ani aaa 
fra etc el 





Gloss in English : an adult person who is male (as 
opposed to a woman) 


Synset Id : 6258 POS : Verb 
Synonyms : alsa, MMA, el 








Gloss : Fd el Vieal-sieal TR GOTH AA | 





Example : faccil de HY acl GI SAT BIR Ist 


Gloss in English : move fast by using one's feet. 





Target language: Bhojpuri 


Synset ID : 196 POS : Noun 
Syno: adaml, maraxa , mAnusa 


Gloss : adamlI jatl ke samuh me ka 
kavano ego 


Example : hara marada ka pahcAna alage 
alage holA 


Gloss in English : a human being; "there was 
too much for one person to do. 

Synset ID: .... POS : Noun 
Syno : maraxa, sawanga, nara, lugavA 


Gloss : nara jAti ka mAnusa 


Example : axamI aUra Oraxa ke xeh ka 
banA vata alaga alaga hoKelA 


Golss in English : an adult person who is 
male (as opposed to a woman) 
Synset Id: .... POS : Verb 


Synonyms : XaudZanA, BhaganA, 
mAnjanA 


Gloss : bahuwe jalaxI jalaxI godZa 
uTAkar calanA 


Example :bilalyA cuhavA ke deKawe 
okarA orl BAgala. 


Gloss in English : move fast by using one's feet. 


We would also be following the hybrid method of the expansion approach along with linkage 
approach. Linking synsets will facilitate cross lingual translation and there is no blockage for 
Information sharing among them. Interface for Bhojpuri WordNet would be in Matrix fashion 
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and using the MCR principle of WordNets. We would have a secondary Interface transliterated 
in WX notation which can increase the readability and efficiency even for non-natives. 


4. Developing Bhojpuri WordNet 

4.1. Problems in the Classification of Synsets 

A WordNet has approximately thousands of synsets. The availability of the concepts in a 
language may differ in comparison of the synsets of English WordNet. We have to be careful 
whether the synsets can be linked or not. Synsets have to be categorized as their existence. 
IndoWordNet and global WordNet Association team have suggested synsets categories. 
Universal: Concepts which are common across all languages of the world may be be 
categorized as universal synsets. For example: The synset of "god". Pan-Indian: Concepts 
which are limited to all languages of India. They are categorized as Pan-Indian synsets. For 
example: "papad". In-Family: Concepts which are limited to a certain family of languages 
(Indo-Aryan, Dravidian and Sino-Tibetan) can be considered as In-Family. For example: 
"bhatija". Language Specific: There is no issue for these types of concepts in synset 
categorization. Only synsets expressing concepts that are common in a specific language have 
to be carefully watched. Transliteration, Type of or Direct Hypernymy , and Artificial phrases 
can be used for them. For example: "KicadzI", the most important festival of Bihar region. 
Rare: These are synsets expressing rare concepts. For example: Technical terms. These 
concepts are necessarily be transliterated. 


4.2. Creation of Language-Specific Synsets 

Language is the unique and common feature of human beings. People have been using it 
for his communication so they need to refer to the concepts. Many concepts are universal but 
some are only limited to some region or religion. Due to the different regions, 
religion, civilization and cultures and long history, some languages have concepts which may 
not be available in other languages. These types of concepts can be organised as language- 
specific synstes because they are only available in a particular language. Bhojpuri has some 
concepts which are language specific, for example 

We followed some steps to identify concepts specific to Bhojpuri. We firstly collected 

Bhopuri specific concepts from multiple sources then verified them on their nature and then 
categorised the identified concepts. We have made an attempt to classify Bhojpuri language- 
specific synsets into four categories as given below: 
a. Cultural untransbility: Purwanchl and Maghadh regions have their own social cultural 
practices, rituals and festivals. The ‘sAma CakevA ’, ‘CaTa’ and ‘KIcadZI’ are only practiced in 
these areas. Marriage trends like ‘gavanA’, ‘caUxT and ‘xoMgA’ are special in the Bihari 
region. These concepts cannot be translated, because these concepts are Bhojpuri language. 


b. Game and Sports: Many regions have some localized sports trends which are limited 
only to the specific areas like ‘holavA pawAla’, ‘gulli dZandA’, ‘piTo’ and ‘got’. 
c. Songs, Music, and Dance:This category refers to folk songs and music such as Xobiya 


nAca, blrahA, niraguna and gita gaval, ‘nirbAnI’ ‘sohara’ etc which are special varieties of folk 
songs and music. This classification also covers musical tools like JaMJa, WariyA etc which 
are only in Bhojpuri. 

d. Kitchen utensils and Regional tools: Some utensils and tools are specially designed 
which are used in different ways. ‘Kurapl’ ‘kajaraUtA’ and ‘gadAsA, ‘hasull’ and ‘kaTUIA’ are 
Bhojpuri language-specific synsets. 


5. Result and discussion 
Since Bhojpuri is considered as a dialect of Hindi language, they are more very similar each 
other. During the development of Bhojpuri WordNet, the expansion approach created many 
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challenges. Due to the regional variation and less rich morphology language, Tadhbhaviness 
features of Bhojpuri, many challenges we faced at the time of creating synsets. Some of them 
are: 

(1) Conceptual inadequacies. 

(11) Hindi words with different meaning/interpretation in Bhojpuri. 

(iii) | Sometimes, no indigenous lexeme in Bhojpuri. 
For example: Synset ID: 6529 
Concept: MExAn ‘field’ Yard consisting of an outdoor area for children; play but for Bhojpuri 
‘mExAna used for going to the toilet outside. 


5.1. Bhojpuri WordNet Statistics 

The development of a lexico-semantic database for Bhojpuri has been carried on for two 
months, and nearly 300 synsets have been created based on the expansion approach and also 
mapped for their linkage possibilities across languages. It reduced the number of words. We 
could find only 219 appropriate synsets and remaining belonged to language specific. For its 
reliability we used a sample of tagged corpus of Bhojpuri. POS statistics of the study are as: 
120 Nouns, 37 Adjectives, 50 verbs and 13 Adverbs. 


6. Conclusion: 

This paper presents a rigorous exercise in design and development of Bhojpuri WordNet. 
Bhojpuri WordNet is an electronic and WordNet styled lexico-semantic database based on 
mental lexicon. The expansion methods for its development made it easy and fast. Semantic 
relations were accessed from the source language Hindi, as Bhojpuri belongs to the same 
language family. Since Bhojpuri has less rich morphology than Hindi so there was a need to 
sort them out by the linkage approach. The WordNet developed through these approaches and 
fashioned in conceptualization, frequency rather than alphabetical order of dictionaries can be 
a very important tool for natural language processing (NLP) tasks like word sense 
disambiguation (WSD), Information retrieval (IR), language specific synsets and machine 
translation (MT) systems across languages. Further scope of the tool is that it can be easily 
linked with other Indian languages so it will allow cross lingual translation and multilingual 
dictionaries and so on. 
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Appendix A ( Psycholinguistic mental lexicon of human language ) 





; Breathes 


Can move 
















Has wings aS als 









Breathes 


Is Yellow °/ 


Can Swim 


Has fins 


Is PINK 


Is edible 







Can Swim Eats meat 








B= IndoWordNet Statistics (As of January 2021) 


Noun Verb Adjective Adverb Total 
Hindi 29807 36837 6336 541 40371 


Assamese 








Bengali 
Bodo 
Gujarati 











Kannada 





Kashmiri 
Konkani 











Manipuri 
Marathi 























Telugu 12078 2795 S776 442 21091 
Urdu 22990 2s01 57836 443 34280 
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Keywords: Abstract 

Ontology, Verb Semantics, In simplest terms, verb frame is the systematic representation of syntactic 
Bengali Verb Frame, Verb — preferences of a verb. Verbs frames are used as lexical resources for various 
Sense Disambiguation NLP applications such as parser, machine translation, language generation 


etc. We start from the assumption that verb’s syntactic behavior is 
determined by its meaning (Levin, 1990). We are developing a Bengali Verb 
Frame Network which can potentially be used for many NLP purposes, 
especially verb sense disambiguation. Our verb frame captures the relevant 
structural context in which verb occurs in a sentence. For an ambiguous verb, 
we hope to get different verb frames for each verb sense and thereby map 
meaning difference to structural difference. Each verb sense is treated as a 
separate entry in our verb frame. Our verb frame represents the relevant 
features of verbal arguments. Ontology, in information science, is usually 
referred to any hierarchical organization of concepts which is used to 
classify, analyse or annotate any data. Our verb frame has four columns 
Dependency Roles, Argument Types, Vibhakti/Post-positions and Ontology. 
We have developed an upper ontology scheme for our verb frame. Though 
we have not directly adapted any existing ontology model, we have consulted 
Suggested Upper Merged Ontology (SUMO) for the background idea as it is 
already widely used in NLP research and linked to many databases like 
EuroWordNet. Ontology shows what the verbal argument we are trying to 
annotate, means in real world thereby incorporate real world knowledge with 
structural information. This kind of real world knowledge is very important 
for meaning disambiguation. This paper describes the role of ontology in our 
verb frame and how it helps in verb sense disambiguation. From machine 
learning perspective, we argue here that ontology is the most suitable way of 
providing real world information. 


1 Introduction: - 

The term ontology has different connotation in different domain of knowledge. The term has 
originated in philosophy where it means the study of being in general. From knowledge 
representation context, Sowa defines ontology as “a catalog of the types of things that are assumed 
to exist in a domain of interest D from the perspective of a person who uses a language L for the 
purpose of talking about D” (Sowa, 2010). Similarly, in linguistics, ontology is usually thought of 
an interconnected network of concepts. Hence, it is quite clear that ontology is regarded as a way 
of representing human conceptualization in all these domains. In artificial intelligence, Gruber 
defined ontology as an “explicit specification of a conceptualization” (Gruber, 1993, p. 908) where 
conceptualisation “is an abstract, simplified view of the world that we wish to represent for some 
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purpose”. In other words, from machine learning point of view, ontology is usually a formal 
structured schema which organises real world objects and abstractions in a hierarchical order. 


Here, we will consider ontology as a technology resource, in contrast to that of a philosophical 
concept. In this regard, ontologies are usually of two types. First one is called domain-specific 
ontology. It represents hierarchy of concepts in a particular area of knowledge like physics or 
biology. Second is known as upper ontology or universal ontology or top ontology. These are 
general purpose ontologies seeking to represent a vast area of knowledge. We are developing 
ontological framework in order to use it in Bengali verb frame which, in turn, will be used for Verb 
Sense Disambiguation. Ontology is now becoming an important aspect of knowledge-engineering 
and information science research. Most WordNets either directly use some kind of ontology 
framework or is mapped to any established upper onlotology. Most noted ontological models are 
developed in some sort of machine readable format. For exploitation of language data for 
computational processing purpose, ontology is absolutely important. Ontological tagging shows 
what a linguistic item refers to in the real world. Whenever we use language, we always process 
ontological information without really noticing it. For, it is the part of our innate world knowledge. 
But computer do not possess any such information. The point of ontology, especially in NLP, is to 
provide some kind of world knowledge so that the machine is able to connect the real world entity 
with the linguistic item. We are developing Bengali Verb Frame Network and the ontological 
format which we are discussing here is a part of this Bengali Verb Frame Network. In this paper, 
I will describe our verb frame model with special focus on the ontology section. 


When a verb has multiple possible senses, the system has to disambiguate between all of them and 
identify its correct meaning as per the context. We will start from the basic assumption that a verb’s 
syntax is determined by its meaning (Levin, 1993). Hence, we expect to get a different verb frame for 
each of the senses of a particular verb. The proposed Bengali verb frame lexicon is planned to capture 
fine-grained, context -sensitive information associated with each sense of a verb. Purpose of this 
Bengali Verb Frame Network is to help NLP tools in sense disambiguation. 

In simplest terms, verb Frame is a systematic representation of syntactic preferences of a verb. Verb 
frame portrays the syntactico-semantic behavior of a verb. Our verb frame is divided in four sections 
which are a) Dependency Roles b) Argument Type c) Vibhakti/Post-position d) Ontology. This paper 
will present development, analysis and critical estimation of our ontology model within the complete 
verb frame model. Verbs frames are used as lexical resources for various NLP applications such as 
parser, machine translation, language generation etc. This work will be an important database for NLP 
tools like parsers, question answering system etc. 


2 Framework: AnnCorra and Paninian Computational Grammar:- 

This section seeks to elaborate the theoretical framework which is Paninian dependency grammar. 
Three sections of our verb frame namely Dependency Roles, Argument Types and Vibhakti/Post- 
positions is directly related to Paninian dependency grammar. Forth and last section is Ontology 
which is the focus of this paper is not derived from Paninian Dependency Grammar. We have built 
a separate framework for Ontology which will be discussed in the later sections. Paninian 
Grammar (PG) is a dependency grammar that treats sentence as a series of modifier-modified 
relations. It was developed by Panini (4" century BC) for Sanskrit language in ancient India. Panini 
produced a linguistic theory for analysis of sentences (Kiparsky and Staal, 1969). In Paninian 
dependency, verb is in the center of all dependency relations. Most important part of Paninian 
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grammar is its karaka relations. Karaka is the structural-semantic relation that connects a verb with 
the nominal elements (nouns and pronouns) in a sentence. Paninian Grammar specifies 6 karaka 
roles i.e, 


1. Karta 

2. Karma 

3. Karana 

4. Sampradana 
ae Apadana 

6. Adhikarana 


These karaka relations described by Panini, to some extent, correspond to the thematic roles. We can 
map them to thematic roles in this way 

(1) karta (k1) ‘agent/theme/experience/actor’ 

(2) karma (k2) ‘theme/patient’ 

(3) karana (k3) ‘instrument’ 

(4) sampradana (k4) ‘recipient’ 

(5) apaadana (k5) ‘source’ 

(6) adhikarana (k7p) ‘location’ 


Karaka roles are not completely semantic in nature. There can be large number of high level semantic 
reltions. But the karaka roles are fixed number of relations which capture only a certain level of 
semantics. Though we can roughly map the last four karakas to their thematic counterpart, the ‘karta’ 
and ‘karma’ are entirely different from agent and theme. These two karakas are roughly equivalent to 
syntactic criteria like sentential subject and sentential direct object respectively (Bharati, Chaitanya 
& Sangal, 1995). This is because thematic roles are purely semantic whereas karaka relations are 
syntactico-semantic in nature. 

Based on Paninian Computational grammar, scholars from IIIT-Hyderabad and others has 
developed a multi level dependency based tagging scheme named AnnCorra (Bharati, Sharma, 
Bai, & Sangal, 2006). AnnCorra consists of three different tag sets 1. PoS tags 2. Chunk tags 
3.Dependency tags. AnnCorra dependency tags have six karaka roles as well as many non-karaka 
roles. Our dependency tags are based on AnnCorra. Very limited modifications are done where it 

is absolutely necessary. We have followed AnnCorra tagging scheme for Dependency Roles 
annotation and Argument Type annotation in the development of our verb frame. 


3 Methodology: - 
Objective of our research, as already stated, is to create lexical knowledgebase in order to 
disambiguate among the many senses of an ambiguous verb. Our basic assumption is meaning 
determines syntactic behavior of a verb. Hence, difference in meaning suppose to cause difference 
in syntactic behavior. Our verb frame is designed to capture the totality of contextual information 
in which the given verb is likely to occur. Since, context is dependent on verb’s meaning, for 
ambiguous verbs, we expect to get different verb frame for each of its meaning. In other words, 
difference in verb frame in different context reflects different meaning of the same verb. 
3.1 The primary resources 
The primary resources that will be used for this study are:- 

1. Bengali monolingual Dictionary 
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2. A Bengali WordNet 
3. Corpus 
4. Verb Frame Model 


Dictionary is needed to get the verbs. Since, some dictionaries enlist different senses for each lexical 
items, it can also be used in order to identify ambiguous verbs. I have selected one Bengali 
monolingual dictionary ‘Adhunik Bangla Obhidhan: Bangla Academy’ and collected more than 100 
verbs which has more than one senses according to the dictionary. WordNet is required for sense 
information. I am using Indo-WordNet which includes Bengali WordNet as well. One simply has to 
search on Indo WordNet website with the verb already collected from dictionary to see the different 
meanings with example. Corpus is the most basic resource for this research. Corpus will be consulted 
to get the syntactic distribution. We have made use of TDIL corpus which comprises of over hundred 
thousand Bengali sentences. ‘Itrans’ scheme is used for transliteration. 


3.2 Architecture of the Verb Frame:- 
Each Verb frame includes a number of basic and necessary information related to that particular sense. 
In addition to the syntactico-semantic information provided in the verb frame other details like the 
following are also provided. Our verb frame consists of two sections 1.General Information 2.Verb 
Frame. General Information section includes some basic details about the verb sense, such as: 

1. Sense ID 
English Gloss 
Verb Class 
Verbs in the Same Class 
Example Sentence 


cman a 


The second section is Verb Frame. It includes: 
1. Dependency relations 
2. Argument Type 
3. Vibhakti/post-position 
4. Ontology 


The verb frame is developed in tabular format. It has four columns: Each one provides a different type 
of information. I have includes two complete examples of our verb frame at the end of this section. 
First column is Dependency relations, the second column: Argument type, third column: 
Vibhakti/Post-position, fourth column: Ontology. We have mostly adopted AnnCorra (Bhrati et al 
2006) dependency tags with little modification. Bengali has certain constructions which cannot be 
properly annotated by using existing AnnCorra tags. We have either modified existing tags or added 
new tags only in these cases. AnnCorra dependency tags are given below in a diagram. 
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mod 


ee 
vmod 


nmod  jjmod rbmod 


i IW : 
NS etc 


rt ‘ras adv ty 













etc 





ras-k] 
ras—k2 
ras—k3 etc 


AnnCorra Dependency Tags 


Argument type is the second column of our verb frame model. Here, we have used AnnCorra 
chunk roles to tag argument types. We have just added one tag label i.e. ‘Cl’ for clause as AnnCorra 
do not have any tag for clause. Each chunk is named according to the corresponding phrasal 
category. For example, noun chunk will be tagged as NP. The tags which we have used to label 
Argument Type are NP (noun chunk), VGINF (infinitival verb chunk), VGNN (gerunds), JJP 
(adjectival chunk), RBP (adverb chunk) and CL (clause) 


Vibhaktis or case endings are the suffixes which marks karaka. Post-positions perform many 
other functions apart from signaling dependency relations. Since numbers of vibhaktis are limited 
in Bengali, post-positions mark dependency roles where vibhakti is not available. Many karaka 
relations and all non-karaka relations are marked by post-positions. 


Ontology is the forth column of our verb frame. We have developed a novel ontology schema for 
our verb frame. It will be described in detail in the very next section 
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Verb frame for the verb sense deoyaa_1 


SENSE_ID deoya_1 
SENSE ‘to pay’ 
VERB_CLASS 4 (k14+k2+kK4) 


VERBS_IN_THE_SAME_CLASS balaa_1, deoyaa_3,deoyaa_4,deoyaa_5,deoyaa_6 
EXAMPLE_SENTENCE o ei kajer jonno amay tiris haajaar Taakaa dichChe 
he this work+gen for me thiry thousand Rs. give+imperfect 


He is paying me thirty thousand rupee for this work. 


VERB_FRAME 


DEPENDENCY_RELATIONS k1 rt k4 k2 

ARGUMENT_TYPE np np np np 

VIBHAKTI/POST-POSITION ie) jonno ie) oO 

ONTOLOGY +human +human +cur 
+org +org 


Verb frame for the verb sense deoyaa_2 


SENSE_ID deoya_2 
SENSE ‘to put’ 

INDO _WORDNET_REFERENCE 
VERB_CLASS 9 (k14+k2+k7p) 


VERBS_IN_THE_SAME_CLASS melaa_1 





EXAMPLE_SENTENCE egulo nice dao 
these below put 
Put these things below 


VERB_FRAME 


DEPENDECNY_ROLES k1 k2 k7p 
ARGUMENT_TYPE NP NP NP 
VIBHAKTI/POST-POSITION Oo Oo e/te 
ONTOLOGY +human +object +region 
+object 


4 Developing Ontology for the Bengali Verb Frame: 

Our ontology is, for the most part, based on Suggested Upper Merged Ontology (SUMO). SUMO 
is an upper ontology, perhaps the largest upper ontology existing in the public domain. According 
to its developers, SUMO (Pease, Niles & Li, 2002) is created by merging a large number of other 
existing upper ontologies. Apart from the upper ontology itself, SUMO has a number of separate 
domain specific ontologies as well. SUMO along with its domain specific ontologies comprises of 
approximately 25,000 terms 80,000 axioms, 4730 rules and 1280 relations. SUMO is written in 
SUO-KIF language. Since it is completely written in formal language, it proves to be very useful 
for the machine learning purposes. SUMO is linked to the various WordNets in many different 
languages. 

We have used the SUMO terms and followed SUMO definition of those terms as much as possible 
in our modeling. But SUMO is neither created specifically for language analysis not it is practical 
to use thousands of terms in our ontology schema. We had to simplify it to suit our purpose. The 
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motivation behind SUMO is to represent all known concepts and objects in a formal, ordered 
hierarchical order, whereas we need ontology in order to make ontological generalisation of the 
verbal arguments. This will, in turn, facilitate us in Verb Sense Disambiguation. Our ontology has 
43 terms altogether. We have here taken a bottom up approach in creating this resource. We started 
from the verb frame and find out what are the ontological concepts we need in order to reach the 
ontological generalisation regarding verbal arguments. Then we have developed an ontology 
model in which our necessary concepts are mapped on. SUMO has the concept ‘human’ as well 
as the concept ‘hominid’. We don’t need a concept like ‘hominid’, but we need a concept for 
‘human’ entities. The reason behind this choice is that a large number of verbs like verbs of 
cognition, verbs of social interaction etc take ‘human’ entities as their agent. There is no such use 
for the term ‘hominid’. Hence, we have included ‘human’ and discarded most of the intermediate 
nodes. We are only annotating the Bengali nominal constructions which can act as an argument 
for the senses of an ambiguous Bengali verb. Hence, we don’t need all concepts. Through we 
made our ontology very simplistic, it is still an upper ontology and is internally consistent 1.e. it 
still represent a wide range of object and concepts in a consistently way. It works well in consort 
with the other elements of our verb frame. Our ontology schema is presented below. 


Ontology Framework 


aOrganisation 
7 Human 
Animate 
Plant 
Living” ™ ‘4 Animal 
4 a 
BodyPart 
7 Natural 
a Non-living \ others 
SelfConnectedObject 
a ‘ 
” Solid 
y p> Liquid ‘ 
obed— “A Artifact > Vehide 
4 " “4 Gas 
Rei 
sais Vv Garment 
» Sound 
Physical pContentBearing Physical 
a is ¥ Tools 
" y Transition yimage 
P Oth 
en’ esta vy LanguageRepresentation v = 
Ontology 
Quanti 
¥ ~Motion 
Abstract ‘ 
s N > Attribute ¥ Proposition 
| 
¥ & Concept 
ime » Idea 
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5 Importance and Issues: 
This section will show the significance as well as issues regarding ontology in our verb frame. 


5.1 Role of Ontology in Bengali Verb Frame: 
Ontology is an added component in our verb frame which is otherwise built on Paninian 
Dependency Grammar. Task of ontology is to provide some other kind of information which 
cannot be obtained by structural-semantic analysis of Paninian dependency framework. 
Dependency roles show the semantic connection between the verb and the argument. On the other 
hand, ontology tells us what kind of entity the argument is in the real world. Ontological 
information is not any kind of relationship between verb and the argument, rather a property of the 
argument itself. Ontological features do not denote a linguistic entity, rather the real world object 
or concept that the linguistic item signals. In this way, ontology is language independent and 
universally applicable, at least with regard to the language. We need such language independent 
resource when we try to systematically decode the real world information packed within linguistic 
structures. Our basic idea is simple. If a given ambiguous verb has 7 different senses, we are 
supposed to get 7 different verb frames. Meaning difference must be reflected in the difference of 
verb frames. 
Consider the following examples taken from corpus, 
1. tin’ aamaake Daaklen 
he/she me+acc _ call+per 
He/she has called me 


2. svaamiike daakla _ strii 
husband+acc calltper wife 
Wife called her husband 


3. poShaa kukurTike daaklen bRRiddha 

pet dog+acc call+per old man 

The old man has called his pet dog 
The verb sense used in above example is Bengali verb Daakaa_1 ‘to call’ as per our Bengali Verb 
Frame Network, which implies - the activity of verbal calling meant to attract attention of the 
hearer. This activity needs two elements; the agent who enacts the activity of calling and a patient 
who is at the receiving end of the activity. As illustrated in the example, the agent or the first 
argument of the verb Daakaa_1 is always a human i.e. it is always humans who calls someone (at 
least in the real world scenario). The second argument or patient can be a human or an animal. 
Hence, in our verb frame we mark the first argument of the verb sense Daakaa_/ as ‘+human’ and 
the second argument as ‘+human +animal’. This way it is able to capture the generalisation 
regarding the verb sense in question. 
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VERB FRAME 


DEPENDENCY RELATION kl k2 
ARGUMENT _ TYPE NP NP 
VIBHAKTI/POST-POSITION oO ke 
ONTOLOGY +human +human 
+animal 


Verb Frame for the verb Daakaa_1 
Now, let us look at the corpus drawn sentences of another verb Daakaa_2 ‘to make a noise’ which 
implies the activity of animals making a sound in Bengali. 

4. gaadhaa theke-theke Daakchila 


donkey attimes sound+pastt+pro 
The donkey was braying 
5. kukur gheu gheu_ kare Daakche 


dog bark 

The dog is barking 
Similarly, these examples clearly show that the agent of the verb Daakaa_2 can only be an animal. 
Therefore, we have marked it as ‘+animal’ in our verb frame. 


post-position sound+pro 


VERB_FRAME 


DEPENDENCY _ RELATIONS k1 


ARGUMENT_TYPE 
VIBHAKTI/POST-POSITION 


ONTOLOGY 


NP 


oO 


+animal 


Verb Frame for the verb Daakaa_2 


In the above examples, the contextual difference between two verb senses is clearly revealed in 
the form of tabular verb frame. Machine can make use of such verb frame to identity the correct 
sense form the context. Here verb frame is different in terms of both the dependency relations as 
well as ontology. But, sometimes two senses of an ambiguous verb appear to have identical 
dependency structure. Then, ontology becomes the most important marker of contextual 
difference. Reverse is also true. In other words, ontological information compliments dependency 
information. Often, ontology presents novel aspects of the context which cannot otherwise be 
revealed. 


5.2 Issues and Limitations: 

Adam Pease in a 2011 lecture said “creating a large ontology does consist of a lot of trivial 
assertions about the world. Things that people already know. But, the fact is computers do not 
know them” (Pease, 2011). He was then taking about one of the primary criticisms directed at 
SUMO. Ontologies are usually built for machine learning purposes. The information that 
ontologies represent may not appear very unique to us, but computer needs this kind of world 
knowledge to perform NLP tasks. Our ontology is specifically developed to categorise nominal 
arguments of Bengali verbs. The objective is to arrive at some categorical generalisations 
regarding the argument themselves which is otherwise consistent with the dependency structure 
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information and compliments the same. Though our ontology performs well in its task it has some 
limitations. One of the limitations of our ontology is that it does not cover all scenarios. Some 
limitations are coming from the purpose of the ontology itself. For example, our ontology cannot 
account for a supposed fantasy fictional or mythological situation where everything, even a stone, 
can talk. We cannot include this kind of fantasy or mythology scenarios in our ontology because 
then it will be open-ended then and it will no longer be used for meaning disambiguation. For, in 
fantasy, everything is possible. We won’t be able to find necessary constrains on which we can 
base our generalisation. There is another kind of limitations which is more subtle. Our ontology 
will not be able take care of the scenario where a robot or speech generation system is talking. Our 
ontology is an upper ontology; it is not designed to take care of such specific situation like robotic 
speech. This information, if necessary, can be conveyed to the machine by other means. We can 
also develop a domain specific ontology for example, ontology of speech technology, to handle 
such scenarios. Purpose of this ontology is to formally represent the fact that when the activity of 
talking takes place in real world, it is the usually the human beings who are involved. Hence, the 
ontological information in Bengali verb frame provides categorical generalisation regarding the 
verbal arguments. But the generalisation assumes and projects real world context; it may not 
represent any specific utterance which represent a rare unusual context. 


7 Conclusion: 


The research initiative which is hereby being discussed is of applied nature. The motto of this 
initiative is to prescribe a framework aimed at resolving Verb Sense Disambiguation. As already 
discussed sense disambiguation is required for increasing the accuracy and precision of various 
NLP applications and it is most useful in machine translation. Though one of the main objectives 
of this project is to create resources for computational linguists, researchers of other areas like 
syntax, generative semantics, translation theory etc may also be benefitted from this resource. We 
believe this research can be equally beneficial for language teachers and second language learners 
of Bengali. 

Ontology is a section in our verb frame. Our Bengali Verb Frame is still under development. 
Though it is aimed at resolving verb sense disambiguation issue which is mostly a NLP problem, 
it can offer fresh insight to the theoretical linguists as well. As per my knowledge, there was no 
previous initiative to create any such resource for Bengali. Verb frame has been developed in Hindi 
and few other Indian languages. But, those resources did not try to include any ontological 
framework. In this way, it is a novel development in verb frame research. Ontology is a section in 
our verb frame. Existing ontologies are not developed for this kind usage. Hence, we had to 
drastically reduce the ontological features and modify some of the features. Our ontology has 
become a novel ontological framework most suitable for semantic analysis. Here, in our research 
project, it reveals completely different aspect of verb semantics which is clearly not accessible 
through Paninian Dependency framework. Ontology has proved to be a very valuable section in 
our research. As ontology is language independent, it may be applied to similar projects on other 
languages with or without major modification. 
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Phonological, Morphological, Telangana is one of the newly formed states in Indian, which was part 


Telangana Telugu, Coastal of the United Andhra Pradesh from 1956-2014, After a long struggle, 

Telugu. Telangana again is re-emerged as a separate state in 2014. In the 
Telangana bifurcation movement along with issues like geographical, 
economic, social, and political Telangana Telugu also have played an 
important role, because in the united Andhra Pradesh the Telangana 
language use to treated as a dialect of Coastal Telugu instead of 
language. Because of some historical, geographical, political reasons 
Telangana Telugu is shown some linguistics variation such as 
phonological, lexical, and morphological levels this has been marked 
Telangana Telugu as dialect not a language. But this 
paper does not cover all those variations but only focuses on the plural 
formation and their morphophonemic changes between Telangana 
and Coastal Telugu. This paper consists of, introduction, reviver of 
literature, method, analysis, or discussion and we will summarize and 
conclude seeking more work in the future on the Telangana Telugu. 


1. Introduction 


Telugu is one of the major literary languages of the Dravidian family of languages. It is 
not only spoken in the two states, viz. Andhra Pradesh and Telangana but also used as a second 
language in the neighboring states like Karnataka, Tamil Nadu, Maharashtra and Odissa. 
Because of many historical, socio-political and geographical reasons, we find variation between 
the Telangana and Coastal Telugu, in phonological, morphological, and lexical levels. Owing 
to the vastness of the subject, this paper concentrates only on the plural formation and its 
linguistics variation in the Telangana Telugu compared to the Coastal Telugu or Standard 
Telugu. The Plural formation is one among the important mechanisms in the noun categories, 
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which can help us know more about variation between Telangana and Coastal Telugu. Because, 
the plural formation is not simple which we add plural suffix to the noun but also we find many 
morphophonemic changes, deletion of syllables before plural suffix, deletion of consonants, 
vowel harmony etc. Due to all these which are mentioned in the above, there are many 
morphophonemic and phonological changes in the Telugu plural formation mechanism. At the 
same time, when we same plural formation in the Telangana Telugu, we find a number of 
morphophonemic alternatives between Telangana and Coastal Telugu. Before that, we must 
know the plural forms in Telugu, In Telugu, the most common plural form is —/u or -lu for 
example nakka ‘fox’ _nakkalu ‘foxes’_kannu ‘eye’ kal/u ‘eyes’ and pragatisiludu ‘progressive 
person_‘pragatisiluru ‘progressive people’ But among these three plural markers the —/u or -/u 
is occurs more frequently in the Telugu and these plural suffixes forms are added to the nouns 
or to the basic stem. There is enough work is carried out on the plural formation in Telugu, such 
as Ramaro (1970.76), Krishnamurti (1974), Krishnamurti and Gywwn (1977, 85), 
Subrahmanyam (1976, 74). and Rao (2000), etc. But this study is mostly concentrated the on 
the Coastal Telugu or standard Telugu, and here is no enough much work carried out on the 
Telangana Telugu compared to the Coastal Telugu. Whereas, this paper is mainly focuses on 
plural formation in the Telangana Telugu and their linguistic variation compare to the Coastal 
Telugu or standard Telugu. Telangana is the 29th state of India which is formed as a separate 
state in 2014. Telangana language sentiment also has played a very important role in the 
Telangana bifurcation movement as Telangana Telugu use to be treated as a dialect or a low 
language by non-academicians. That’s true that Telangana Telugu 1s shown some linguistic 
variations compare with Coastal Telugu, the main reason is that Telangana Telugu is greatly 
influenced by the different languages such as Hindi, Marathi, and Urdu and Persian_and which 
belong to different language families. There are few strong reasons for language influences on 
the Telangana Telugu, because, if we look at the geographical borders of the Telangana state, 
it shares its borders with different states, such as in the North with Chhattisgarh, in the west 
with Maharashtra and Karnataka, and in the South with Andhra Pradesh. We can say that 
Telangana looks like an open borders state which is surrounded by different states and their 
languages. Historically, the Telangana state is ruled by the Nizam dynasty for more than two 
hundred years, in their period Telugu has mostly ignored and heavily influenced by the official 
Language called Urdu, because of that reason today we find number of Urdu vocabularies in 
the Telangana Telugu. Because of these geographical and historical reasons, Telangana Telugu 
has lots of language variation and is influenced by the above languages. Due to the linguistic 
variations in phonological, lexical, and morphological levels, Telangana Telugu has been 
treated as dialect or low language before states bifurcation, but now the situation is changed 
because Telangana Telugu has declared as state official language and also it’s implementing in 
many important domains. 


1.1. Literature review 


There are no specific studies that have been carried out on the Telangana Telugu superficially 
on plural formation. However, there are a few are Telugu dialect bulletins are there that are 
published by the Telugu Academy such as, Radhakrishna (1971, 1977, 1985), Shivananda 
(1974, 86), Srihari (1986, 98), Laxminarayana (1999), Krishnamurti (2002), Bhaskar, (2003), 
Venkanna (2014), Reddy. (2018), etc. in the united Andhra Pradesh and these bulletins have 
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discussed the plural formation and variation in certain extensions, but they have not focused 
completely on the plural formation and on their morphophonemic changes. But there is an 
important work carried out on the Plural formation on the standard Telugu by Ramarao (1970, 
76),. Krishnamurti and Gywnn (1977, 85) and Rao (2000,2012), their work have enough 
discussed also categorized plural formation in Telugu into different types, however, but these 
studies are mostly applicable to the Coastal Telugu, but presently this paper mainly discusses 
plural formation in the Telangana Telugu and their morphophonemic variation compare with 
Coastal Telugu or so-called standard Telugu. 


1.2. Research Questions 


I. Telangana and Coastal Telugu varieties are belonged to two different regions and also 
the difference in terms of phonological, lexical, and morphological levels, how these 
two varieties differ in plural formation? 


2. Telugu being is Dravidian language and its plural formation is completed, which there 
many morphophonemic processes take place in the process of plural formation, then, 
how the morphophonemic processes differ in the Telangana Telugu when we compare 
with the Coastal Telugu? 


3. The plural formation is distinguish between Telangana and Coastal Telugu varieties but 
what are those nouns in the Telangana Telugu, which they show similarities and 
differences with Coastal Telugu? 


2. Article Structure 


This article is followed into four major sections, such as Method, Data Collection, Discussion, 
Conclusion, appendix, and references list. The Method is mainly focused to find out mainly on 
morphophonemic changes between the Telangana Telugu and Coastal Telugu in plural 
formation. The second part is Data Collection has discussed the process of data collection and 
from different sources, such as published sources books, novels, bulletins, and short stories, 
television, and social media shows in Telangana Telugu. The third section is the discussion, 
which is the main part of this paper, it is discussed and show analysis of the data between two 
Telangana Telugu and Coastal Telugu. . This section has taken seven sets of different nouns 
forms, based on their ending forms, and then compared with Coastal Telugu, to see the 
similarities and differences in morphophonemic levels. The fourth section summarizes the 
paper and ends with some suggestions for future research. the fifth section is the appendix 
provides a sample of data and the last section references list which I have referred to in this 
article. 


3. Method 


This paper mainly focuses on the similarities and differences in the plural formation between 
Telangana and Coastal Telugu. So, here, we have analyzed the data between these two varieties 
in terms of variation. 


3.1. Data Collection 


Data has been collected from different sources mostly published resources in the Telangana 
Telugu or Telangana dialect such books, articles, short stories, novels etc. The amount of data 


176 


is collected around 500 items from the Telangana Telugu and reference Telugu has collected 
from those difference sources available in the Coastal Telugu or standard Telugu to the 
reference purposes. The collected data has been analyzed into different levels to see variation 
and morphophonemic changes between these different varieties. 


4. Discussion 


This paper has been categorized the Plural formation into different types based on their syllable 
endings, based on the Krishnamurti and Gywwn (1976, 85) categorization. Krishnamurti and 
Gywwn (1976, 85) listed the plural forms into eight types and three subtypes based on linguistic 
behavior before plural suffix -lu or -/u suffixes. But I have not taken all the categories but 
taken few types of plural formation which they are the relevant and which they some linguistics 
variation from Coastal Telugu or Standard Telugu, for examples. 


Set-1 


appended with the plural marker —lu which does not undergo any change further in terms of 
vowel harmony or retroflexion of —/u 


| 2. | pilla | pilla-lu | pilla-w _| Children _| 
| 3. | reku | rekulu | reku-lu | Tins 
| 4. | ropfe | ropfe-lu | roffe-lu __| Breads__| 
| 6. | phoso_| phojolu__| phojolu__| Photos _| 
ea re € 


Oo 


S.N Basic | Pl form Pl form Gloss 
Stem in TTe in CTe 
2: 
3 / / 
4 


kukka-lu_| kukka-lu 





ceMca-lu_| ceMca-lu Spoons 


Set-2 
When a noun ends with —ru or -rre and rru, the vowel of the final syllable gets deleted in the 
Telangana Telugu before the plural marker-lu. Then the —r+/u and rr+lu get metathesized to- 
lru and -lru. However, there remain certain nouns ending in —rre, do not allow the metathesis 
in their plural formation. This can be explained by the frequency of usage. For example: 

Stem | inTTe in CTe 


barlu/lru 
vertu tru lu 
kittur-lru/tlu 





Certain nouns that end with —rre as in purre, ‘skull’ varre ‘canal’ and gorru ‘a kind of plough’ 
do not delete the final vowel and do not undergo the metathesis. 
Set-3 
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When stems ending in -ndi, ndu, nfi, nfu,di, and du occur the final vowel is deleted before the 
plural suffix forming -n//- and -ndl- clusters. Alternatively the retroflex stop (t, d) is elided 
forming nasal plus lateral clusters in the Telangana Telugu as in the following: 


Stem in TTe in CTe 


pandlu/ntu/llu_| pandlu/nlu 
badlu/llu 
doddlu doddlu 





Set-4 
When nouns ends with di, du, li and ru, syllable then the final di, du, li and -ru get absorbed 
into the lateral giving rise to geminate laterals as in //u. 


S.No | Basic Pl form Pl form Gloss 
BN [Sem [intre [incre [on 
e 
tammullu 
tall 
| 6. kau kill i elu [Legs 
| 7. | kandili__|kandillu __|-- | lamps 
| 8. | akili_—— thi _| vdkillu _| Door ways | 





Set-5 

In the Coastal Telugu if a noun ends with ru, du, ndu syllables, and then these syllables endings 
nouns are takes -//u as their plural suffixes, but in the case of the Telangana Telugu, the same 
syllable ending nouns are takes an alternative plural suffix which is called—//u. Along with that 
we find number of morphophonemic changes occurs in the Telangana Telugu compared with 
the Coastal Telugu, because in the Telangana Telugu these nouns are ends —/lu, dlu, or nlu 


variations. 
S.No | Basic Pl form Pl form Gloss 
Stem in TTe in CTe 
5. 
7. 
10 


tach 


cigurlu/llu 
edurlu.llu 
bicc 


pamelu/llu/nlu 
punelu/lu/ntu 





| 5. 

| 6. 

| 7. | magac 

| 8.| biccagadu | biccagallwdlu | iccagélu_| Beggars _ 
| 9. | pane 

| 10) 


Set-6 
Generally in the Telugu, when we refer to the group of people or honorific there we use two 
different marker—ru and -//u, such as mancivaru or mancivdllu “good person or good people’ 
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but here —ru can indicate singular honorific and -//u can indicate as plural or singular honorific 
and plural form in general. But in the Telangana Telugu, the singular honorific marker —rudoes 
not find, whereas, we find —//u suffix to indicate the singular or honorific and plural marker for 
example miru/millumancollu (you people are good), etc. But the basic differences between 
Telangana Telugu and Coastal Telugu is that When this kind of nouns are followed the plural 
suffixes forms —//u or —//u then nouns final vowel -i is get deleted or noun final syllable is va 
— 6 before —Ilu suffix in the Telangana Telugu for examples, kKammari+vda/lu — kammarollu 
‘blacksmith caste people’. Seethe following examples. 


Stem in TTe in CTe 
komagi- véillu 


| 6. | manci‘vallu | mancollu | ancivajlu _|Good people 
Women 

| 8. | fella valu ellavaiflu 
| 9. | mana-vaflu | mandlu__| ana-walu_|Ourpeople 





Set —7 

Most of the geminate or double consonants syllable—//i or single syllable like —/i ending nouns 
generally, those nouns take -//u as plural suffix in the Coastal Telugu but the same syllable 
ending nouns in the Telangana Telugu has an alternative plural suffix —//u instead of -//u suffix 
form or the retroflex -//u is palatal —//u in the Telangana Telugu before the plural suffix —lu 


Stem in TTe in CTe 
5: j 


naganellu/lu 
némandlu/llu 
| 5. pelli___| pelindlu/llu__| pelliflu 





After looking at plural formation between the Telangana Telugu and Coastal Telugu, these two 
varieties are linguistically differed in the Plural formation to a great extent and especially differs 
in the morphophonemic changes, vowel harmony, deletion of syllables, etc. the Basic 
differences between Telangana and Coastal Telugu is that Telangana Telugu uses —/u as plural 
suffix instead of -/u where is normally, and also the Telangana Telugu shows a number of 
variations in morphophonemic levels, where it is inserted new phonemes before the plural suffix 
like dlu, ndlu, nlu which we do not find such insertion in the Coastal Telugu. Another most 
important characteristic of plural formation in the Telangana Telugu is that the retroflex -lu is 
always get changed as —lu, because of all the above reasons, we find variation in the plural 
formation in the Telangana Telugu compared with Coastal Telugu and it is one feature which 
shows that, Telangana and Coastal Telugu some linguistic variations. 
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6. Conclusion 

This paper has discussed the plural formation between Telangana and Coastal Telugu, which 
more focused variations in plural formation and their morphophonemic changes between 
Telangana Telugu and Coastal Telugu. Telangana Telugu has many linguistic variations 
compared with the Coastal Telugu which makes the Telangana Telugu varieties from the 
Coastal Telugu in terms of phonological, lexical, and morphological levels, Because of this 
linguistic variation, Telangana Telugu has been treated as a dialect of Telugu or Coastal Telugu, 
but which modern linguistic does not accept. But this paper has tried one part of linguistic 
variation is plural formation which is shown morphophonemic and plural formation between 
Telangana and Coastal Telugu. Because of the many geographical, historical and political 
reasons, Telangana Telugu is differs for many aspects, so we need a detailed study to know 
more and explore the Telangana Telugu in different aspect sand which will help to implement 
the Telangana Telugu in different domains, such as design syllable in educational purposes, 
implement in the government and media and etc. 
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Appendix 
This data has been taken from the different published works such as published bulletins, 
published novels in Telangana Telugu, short stories, social media such as online Youtube online 
channels, and others sources available. I have collected around 500 items, but just for sample 
purposes, I have given around 50 items. 


Root PlinTTe PICTe Gloss 
kukka kukkalu kukkalu Dogs 
pilla pillalu pillalu Children 
réku rékulu rékulu Tins 

rotte rottelu rottelu Breads 
pallt palltlu palltlu Groundnuts 
photo photolu photolu Photos 
laddti laddtlu/ ladlu laddilu/ Sweet ball 
ceMca ceMcalu ceMcalu Spoons 
kddalu kodallu kddallu daughter-in-laws 
gorre gorlu/lru gorrelu Sheep 
barre barlu/Iru barrelu Buffalos 
pudla pudkalu pudkalu Sticks 
véru vérlu/Iru/llu —-vérlu Roots 
kituru koturlru/llu — kitullu Daughters 
bandi bandlu/nlu/Ilu bandlu/ ballu Bikes 
kalu kallu kallu Legs 

uru drlu /dllu arlu Villages 
pandu pandlu/nlu/Ilu pandlu Fruits 
doddi dodlu doddlu Toilets 
badi bandlu /Ilu ~— badulu/ ballu Schools 
gudi gullu gullu Temples 
tammudu tammullu tammullu Brothers 
tadu tallu tallu Ropes 
kalu kallu kallu Legs 

goru gorlu/gdllu — gdllu Nails 

noru norlu/llu norlu Mouths 


ciguru cigurlu/Ilu —cigullu Sprouts 


veduru 
gudu 
magadu 
biccagadu 
ceyi 
pandu 
pundu 
mancivallu 
adavallu 
tellavallu 
manavallu 
mi vallu 
kaikili 
goddali 
nagali 
raitu 
némali 
pelli 
kannu 


celle 
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vedurlu.llu = vedullu 
gullu gullu 
magallu magallu 


biccagallu/dlu biiccagalu 
cetulu/ceyilu cetulu 
pamdlu/Ilu/nlu pallu 
pundlu/Ilu/nlu pullu 


mancollu mancivallu 
adollu adavallu 
tellollu ellavallu 
manollu anawallu 
mi Ollu mi vallu 
kaikillu panulu 


goddandlu/llu goddallu 
nagandlu/llu  nagallu 
raitulu raithulu 
némandlu/llu némallu 
pelindlu/Ilu 


kandlu /kanlu kallu 


pellillu 


celendlu/cellelu cellellu 


Bamboos 
Nests 

Men 
Beggars 
Hands 
Teeth 

Sores 

Good people 
Women 
Britishers 
Our people 
Your people 
Wages 
Axes 

Plows 
Farmers 
Peacocks 
Marriages 
Eyes 


Sisters 
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and the poor and deprived had to remain content with the education in 
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Gradually it left its impact in various ways, for some very productive and for 
many disastrous. After a few decades Macaulay’s Minute came into effect, a 
division emerged within the Indian students; one those who were able to 
understand the English language and second were those who could not cope 
up with the expectations as laid out by the language. 1854 Wood’s Despatch 
realized this emerging gap and recommended that primary schools must 
adopt vernacular languages, high schools Anglo-vernacular language and 
English should be the language of instruction at college level. This 
distinction still persists in our existing education system and has impacted it 
quite powerfully; more precisely at higher education level. This led to a ditch 
within the Indian nationals; one those who managed to receive the education 
in English and second those who could not. Since then this ditch has ever 
been growing and getting more and more intensified. Still there are many 
people in India who find English language difficult to understand which is 
increasingly intensifying the consciousness of “English Speaking People’ 
and ‘Non-English Speaking People’ within the natives. 

The present paper is being written with the purpose to trace the creation and 
then the development of this ‘ditch’ since the inception of the educational 
model founded upon Macau-lean ideals, and what difficulties people have to 
go through after receiving an education at the school level through their 
respective vernacular language with acquiring the broken knowledge of 
English and then to aspire for higher education for which the only portal is 
English language. 
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1. INTRODUCTION 


The question of language in the education system of India has been a matter of much debate since 
the colonial times. It has played a very crucial role in structuring the educational policies of the 
regimes. From time to time it was contested and debated by the various interested parties whom 
the hegemonic policies of the day affected. Some of the debates related to the issue of language of 
education are still relevant, even more than two and half centuries after they were surfaced for the 
first time on the Indian subcontinent by the East India Company. 

The problem of the medium of instruction and the subject matter (Western knowledge or the 
prevalent oriental learning during that time) of teaching to the Indian nationals became a matter of 
hot debate only after Macaulay presented his Minute on Education (1835) before the British 
Parliament and East India Company that brought the sweeping changes in the education of India 
and successfully replaced the oriental learning with the westernized version of learning. Macaulay 
quite enthusiastically valorized the western knowledge and English language in his Minute that 
English came to be advocated as the medium of instruction for this task because imparting a 
westernized version of education through any of the Indian or oriental languages would have been 
very challenging, and at the same time absolutely fruitless in Macaulay’s words. Therefore, 
English became the language of instruction at the higher level education with various other Indian 
vernacular languages at school level. The demand for English was heavily noticed from the elites 
and rich, and the poor and deprived had to remain content with the education in vernacular 
language or with no formal education at all. 

Gradually it left its impact in various ways, for some very productive and for many disastrous. 
After a few decades Macaulay’s Minute came into effect, a division emerged within the Indian 
students; one those who were able to understand the English language and second were those who 
could not cope up with the expectations as laid out by the language. 1854 Wood’s Despatch 
realized this emerging gap and recommended that primary schools must adopt vernacular 
languages, high schools Anglo-vernacular language and English should be the language of 
instruction at college level. This distinction still persists in our existing education system and has 
impacted it quite powerfully; more precisely at higher education level. This led to a ‘ditch’ within 
the Indian nationals; one those who managed to receive the education in English and second those 
who could not. Since then this ‘ditch’ has ever been growing and getting more and more 
intensified. Still there are many people in India who find English language difficult to understand 
which is increasingly intensifying the consciousness of “English Speaking People’ and ‘Non- 
English Speaking People’ within the natives, and the Indian nationals are still trying to cope up 
with the challenge posed by the British administrators to learn English language and the western 
ideals of education. 


1.1 LITERATURE REVIEW 

This paper attempts to provide the historical perspective about the arrival and then the development 
of English education in India and what challenges people had to face and are still facing as a 
consequence of that. For the purpose of this paper, an extensive, analytical and comparative study 
has been done of the literature based on the historical background of English language and English 
language teaching in India. 


1.2 RESEARCH QUESTION 
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The purpose of this paper would be to trace the creation and then the development of this ‘ditch’ 
since the inception of the educational model founded upon ‘Macau-lean’ ideals, and what 
difficulties people have to go through after receiving an education at the school level through their 
respective vernacular language with acquiring the broken knowledge of English and then to aspire 
for higher education for which the only portal is English language. 


2. METHOD 


The selected literature was taken from libraries, internet and Google scholar, and was speculatively 
debated and discussed with my peers, and then interpreted in the following manner as discussed 
in the ‘Result and Discussion’ section to discuss the stated idea in the ‘Research Question’ sub- 
section. 


3. RESULT AND DISCUSSION 


3.1 INTRODUCTION OF ENGLISH LANGUAGE INTO INDIAN EDUCATION SYSTEM 


This sub-section will attempt to throw some light at the circumstances that led East Indian 
Company to introduce English language into the Indian education system. 

It was only in the end of eighteenth century that East India Company felt compelled to think over 
the education of the natives of India in order to civilize them. It was regarded that to trade with the 
civilized people would be more beneficial and profitable for the company than to trade with 
uncivilized and uneducated savages of India. Missionaries came in huge number from England 
with the aim of proselytization and impart an education in English to the Indian mass laden with 
Western morals, thoughts, knowledge of science, philosophy and more importantly Christian 
religion with its beliefs. 

Charles Grant, a director of East India Company, came to India in 1767 under whose wings 
missionaries found the support for their purpose. In 1792 Grant prepared the first blue print on 
Indian education and in the same year he wrote a book titled Observations on the State of the 
Society among the Asiatic Subjects of Great Britain, particularly with respect to Morals and the 
Means of Improving it with the suggestion of a policy to bring about moral, social and mental 
changes in Indian society through the English language, Western education and Christianity. Grant 
recommended the introduction of English language as the medium of instruction and a Western 
system of education that included literature, natural sciences, mechanical inventions etcetera, to 
remove the superstitious beliefs prevalent among the heathens of Indian; and the adaptation of 
English as the official language of the Company and the Government for easy communication 
between the rulers and the ruled. (Krishnaswamy & Krishnaswami, 2006) 

Grant with Zachery Macaulay (father of Thomas Babington Macaulay), William Wilberforce and 
a few others relentlessly wanted a favor from the British government for an education in English. 
He believed that the Christian faith with the medium of English would abolish the evils in Hindu 
society and liberate the minds of Hindus. He stated that, “Whenever this knowledge would be 
received, idolatry, with all the rabble of its impure deities, its monsters of wood and stones, its 
false principles and corrupt practices, its delusive hopes and vain fears, its ridiculous ceremonies 
and degrading superstitions, its lying legends and fraudulent impositions would fall” 
(Krishnaswamy & Krishnaswami, 2006). In 1793, Wilberforce moved a resolution before the 
British parliament supporting the Anglical education. 
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But the British parliament rejected the idea of Anglicist learning and considered better to follow 
the policy of Warren Hastings (Governor of India during 1772-85) who respected Indian traditions 
and supported the education of natives through the Oriental and Indian classical languages. In 1781 
Hastings founded the Calcutta Madrassa and in 1791 the Benares Sanskrit College for both Hindus 
and Muslims to encourage Oriental learning. The Government of British India marked this policy 
as Oriental School of Education. 

The reason of the rejection of Anglicist learning was that the British did not want to face any 
cultural confrontation from Indians; and thought that support to missionaries in India would be, in 
a larger sense, counterproductive and it was better to follow the policy of non-interference. 
Favoring this aspect, a member of British Parliament said: “We have lost our colonies in America 
by imparting our education there; we need not do so in India too” (Mukherji, 1951). 

The missionaries led by Charles Grant did not like it and criticized harshly the policy of Oriental 
education supported by British government in India. In 1813, the Charter of the Company was 
renewed and the missionaries were allowed to carry on with their activities of preaching Gospel 
and Christianity through English language. This came to happen because Charles Grant was the 
Chairman of East India Company in 1807, Deputy Chairman in 1808 and Chairman again in 1809; 
and the policy of the Company was reviewed vigorously under his surveillance. The question of 
the language of education was discussed ambiguously in the resolution of 1813. The resolution 
made no mention of the language of education and that only stated that the surplus was to be 
allotted to the ‘improvement of literature and encouragement of the learned natives of India’ which 
further complicated the issue between the supporters of Oriental and Anglicist education 
(Krishnaswamy & Krishnaswami, 2006). The Charter Act of 1813 also conveyed that the 
government was to spend one lack of rupees every year on Indian education (Krishnaswamy & 
Krishnaswami, 2006). 

In 1823 a General Committee of Public Instruction was selected consisting ten members having 
the supporters of both Anglicist and Oriental education. Between 1823 and 1833, the Calcutta 
Madrassa and Benares Sanskrit College were given the government recognition by the Committee. 
Sanskrit college at Poona in 1821; and two more Oriental Colleges at Agra in 1823 and Calcutta 
in 1824 were also started. The Committee took the charge of the printing of Sanskrit and Arabic 
books; and employed Oriental scholars to translate books from English into classical Indian 
languages. The intention of the Committee in this was to gain a favorable impression from the 
people of India, especially the learned ones. 

In accordance with the Oriental learning, a number of Christian schools and colleges were also 
established by the Anglicists in various parts of India. The Baptist Mission schools (1815), the 
Serampore College (1818), the London Mission Society’s Schools (1818), the Bishop’s College at 
Sibpur (1820), the Calcutta School Society’s schools (1819), the Jaya Narayan Ghoshal’s English 
School at Benares (1818), and the General Assembly’s Institution (1830) were founded imparting 
an education to Indians in order to convert them to a new language, new religion and new culture 
(Krishnaswamy & Krishnaswami, 2006). In their efforts, Missionaries were successfully in 
creating a demand for English education among the Indian natives who were under their learning 
process and the General Committee of Public Instruction had to introduce English classes in 
Calcutta Madrassa, the Benares Sanskrit College, Delhi College, Agra College and other Oriental 
institutions. 

Later on, British noticed the increasing demand of English education from the Indian natives itself. 
Some great Indian scholars, including Raja Ram Mohan Roy, demanded for an education in 
English from the British government. Raja Ram Mohan Roy wrote a letter in December, 1823 to 
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Lord Amherst concerning about the idea of establishing a Sanskrit college in Calcutta by the 
Committee of Public Instruction. He wrote: 

“We now find that the government is establishing a Sanscrit school under Hindu pundits to impart 
such knowledge as 1s already current in India... The pupils will there acquire what was known two 
thousand years ago with the addition of vain and empty subtleties... such as is already commonly 
taught in parts of India” (Krishnaswamy & Krishnaswami, 2006). 

The intention of Raja Ram Mohan Roy behind asking for English education was to make-up the 
intellectual aspect of Indians and to rejuvenate ancient learning by fledging it with the modern 
knowledge of the West. But this demand was very cunningly turned into a demand for English as 
the language of education, the medium of instruction and the dissemination of Western religion, 
morals, thoughts and values in the mindset of Indians by the British Anglicist supporters. 

Then it was the arrival of William Bentinck who became the Governor General of India in 1828. 
William Bentinck was a friend of Charles Grant and he also wanted Anglicist education pattern in 
India. Bentinck wrote a letter to the Committee of Public Instruction asking to make English the 
official language of the Government and the language of education. On 29" September, 1830 
Bentinck received a letter from the Court of Directors allowing him to introduce English as the 
language of public business in all its departments (Mukherji, 1951). 

It was June of 1834 when Thomas Babington Macaulay came to India and became the first member 
of the Governor General’s Executive Council. He was appointed as the President of the General 
Committee of Public Instruction. Lord Macaulay teamed up with Charles Grant Jr (son of Charles 
Grant) to bring English language and English education system in India imparting the Western 
knowledge, culture and religion. Lord Macaulay’s brother-in-law, Charles Trevelyan, also became 
the member of the committee and Bentinck regarded both of them in high esteem. The debate 
between the Orientalists and Anglicists seemed to have no end and various suggestions came up 
from both the fractions in the Committee favoring their own respective sides. 

On 2™ February, 1835 the famous Minute on Education by Lord Macaulay came out in which he 
talked about bring sweeping changes in the education system of India and criticized too harshly 
the centuries old and long traditional and cultural education system of India, being imparted orally 
from generation to generation in the indigenous classical languages — Sanskrit, Urdu and Persian. 
Macaulay very clearly and unambiguously stated in his Minute about the objective of having 
English education in India: 

“We must at present do our best to form a class who may be interpreters between us and the 
millions whom we govern — a class of persons, Indian in blood and color, but English in taste, in 
opinion, in morals, and in intellect.” (Macaulay, 1835) 

Macaulay bluntly disregarded the Oriental learning and considered it to be useless. He argued for 
to stop the expenditure that was being spent on the printing of Arabic and Sanskrit books since he 
considered that “a single shelf of a good European library was worth the whole native literature of 
India and Arabia”.(Macaulay, 1835) He desired to divert those funds towards the English 
education and asserted: 

“But I would strike at the root of the bad system which has hitherto been fostered by us. I would 
at once stop the printing of Arabic and Sanscrit books, I would abolish the Madrassa and the 
Sanscrit College at Calcutta.” (Macaulay, 1835) 

Macaulay threatened to resign from his post as the Chairman of the Committee if his Minute is 
rejected. But the Governor General of India, Lord William Bentinck, wanted the same and without 
any hesitation and a moment of thought accepted Macaulay’ Minute on 7 March, 1835. 
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In 1837 English was made as the official language of education and Government installed a policy 
to establish English schools or Anglo-vernacular schools in every district of India. These schools 
used vernacular languages besides English and those schools which were considered Good Schools 
were given the status of college. There were missionary schools also that taught English language. 
By 1845 there were around 30,000 missionary schools according to the figures presented to the 
House of Commons in the same year (Krishnaswamy & Krishnaswami, 2006). 


3.2 ENGLISH BECOMING AN INTEGRAL PART OF INDIAN EDUCATION SYSTEM AND THE 
EMERGENCE OF THE ‘DITCH’ 


By 1850 English language got hold of very strong roots in India. It was not only the East India 
Company that wanted English education for the Indian national, but also an increasing demand 
was being experienced from some of the Indian nationals who were becoming keen to learn 
English language, especially the Brahmins and some from middle classes. Bengal and Madras 
were the states where a great demand of English was noticed. In 1854, the Hindu College in 
Calcutta was taken over by the Government and renamed as Presidency College. All this 
strengthened the scope for English education in India. 

In 1853 the renewal of the Charter came up before the Parliament and found that after Macaulay’s 
Minute the changes that were brought in the educational system of India created a ditch between 
the people and the government which spread the feeling of alienation among the masses. Charles 
Wood, also known as Lord Halfax, was the President of the Control of East India Company and 
prepared a Despatch in 1854 conveying that the “English language would be taught where there 
was a demand for it, and that it also would be combined with a careful attention to the study of the 
vernacular language of the district” and the “subject of Christian religion would be provided out 
of school hours and only if asked for voluntarily” (Krishnaswamy & Krishnaswami, 2006). 

The Despatch by Wood had a very different, convincing and soft tone than that of Macaulay’s 
Minute. But this Despatch also created a dichotomy of giving one kind of education to the classes 
and one kind of education to the masses. Education in English was expensive and only financially 
strong people and the upper classes were capable of affording this education. Hence the demand 
only came from the upper classes and the poor had to remain contented with the education in 
vernaculars. Therefore Wood’s Despatch actually all the more intensified the creation of the ‘ditch’ 
that is the subject matter of this paper. 

In 1857, which was also the year of ‘Great Mutiny’, three universities were established by the 
British in the Presidency cities of Bombay, Calcutta and Madras. Lahore University in 1869, 
Punjab University in 1882 and Allahabad University in 1887 were also started by the British 
government where education was imparted through the medium of English (P52-54, 
Krishnaswamy). English was the language of business and administration; and people in heavy 
number moved to learn this language in order to secure government posts in the British 
Government administrative departments which gave a serious impact on the education being 
imparted in vernaculars. 

After the revolt of 1857, the nationalist movements initiated the process of strengthening their 
roots against the British Empire. The nationalists realized that the motif of British in introducing 
English as the language of education and the medium of instruction was to sow the Western ideas, 
morals and values in the minds of Indians; and to fulfill their materialist and commercial means. 
As the nationalist resistance got a strong hold in the second half of the nineteenth century, the 
nationalists started to demand for an education in the vernacular. During the end of the nineteenth 
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century and the beginning of the twentieth century, Lord Curzon’s policy came out and formed a 
favor with the nationalist movement which led to the changes in the medium of education and 
allowed English only be taught to those who were qualified to learn it. 
Lord Curzon became the Viceroy of India in 1898 and realized that the English educated Indians 
had started to revolt against the British Empire with the very Western ideas like liberty, equality 
and fraternity that had been given to them by the British. In his speeches Lord Curzon started 
addressing to the issues related to the educational language of Indian that emerged after 
Macaulay’s Minute. He stated: 
“Ever since the cold breath of Macaulay’s rhetoric passed over the field of Indian languages and 
Indian textbooks, the elementary education of the people in their own tongue has shriveled up and 
pinned. 
The main obstacle which primary education has to contend with springs from the people 
themselves. As they rise in the social scale, they wish their children to learn English. 
By all means, let English be taught to those who are qualified to learn it, but let it rest upon a solid 
foundation of the indigenous languages, for no people will ever use another tongue with advantage 
that cannot first use its own with ease. 
Unless a good training in the Vernaculars is given in the schools, no effort of the university will 
avail” (Pattanayak, 1990). 
In 1902, the Indian Universities Commission was appointed whose work was to draft proposals to 
improve the working of universities in India and also to recommend the measures and suggestions 
how to raise the standard of universities education. The Commission was successful in 
recommending measures to supervise and control universities and colleges but did not suggest any 
measures to improve the quality of higher education. 
The nationalists’ demand for an education in vernacular could not go unnoticed any longer. The 
Resolution on the Educational Policy of the Government of India was passed in 1913. This 
resolution helped in getting recognition for Indian primary education and the Government was 
made to promise to arrange funds for primary level education. Its recommendation was to start 
primary schools at proper places and during vacations teacher training programs were to be started 
for teachers of primary schools. A number of recommendations were also made regarding the 
betterment and improvement of the educational curriculum and the mode of examination at all 
levels. The resolution was also meant to provide the expansion for university education and 
recommended the starting of fifteen more universities. There were 5 universities and 185 colleges 
in existence at the time of the resolution was passed, and that number was considered insufficient. 
By 1946-47 the number of universities was increased up to 19; and the number of colleges was 
231 in 1921 and 385 in 1939. (Krishnaswamy & Krishnaswami, 2006) 
This resolution became a very significant pronouncement and later on led to the establishment of 
vernacular schools from primary to the secondary level. It was observed: “There is much 
experience to the effect that scholars who have been through a complete vernacular course are 
exceptionally efficient mentally” (Pattanayak, 1990). In 1915 the question of introducing Indian 
vernaculars as the medium of instruction at the secondary stage was raised before the Imperial 
Legislative Council and the usual arguments given in response to this question were such as: 

1. Suitable textbooks were not available in the Indian languages; there was no technical 

vocabulary in subjects like science, mathematics etcetera in the vernaculars. 
2. There were no suitable teachers for teaching technical subjects in the vernaculars. 
3. There was no demand for vernacular education. 
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4. There was not enough money; it would be very expensive to introduce Indian languages 
as the media of instruction. 
5. The exposure to English would be reduced. (Krishnaswamy & Krishnaswami, 2006) 


Hence, English continued to be the medium of instruction and examination in most schools and 
colleges; and government refused to make any changes in the policy. 
In January 1917 a conference of the Directors of Public Instruction was convened by the 
Government of India at Delhi and another meeting of the representatives of local government at 
Simla in August 1917. The main concern of both the conferences was about the question of 
introducing the vernacular as the medium of instruction at the high school level. It was decided 
that the study of English as a language should be introduced early and it should be adopted 
gradually as the medium of instruction in the highest two, three or four classes of the high school. 
The students were allowed the option to answer their examination at the end of the high school 
course in English or vernaculars in all subjects except English; and English still continued to be 
the medium of instruction as well as of examinations. (Krishnaswamy & Krishnaswami, 2006) 
In 1913 resolution it was recommended that the necessary steps were to be taken for the 
improvement of the educational curriculum and the mode of examination at all levels. In 
accordance with the 1913 resolution, a committee was to appoint to look into and study the 
problems of Calcutta University that were springing up. This commission was set up under the 
Chairmanship of Dr M. E. Sadler and called by the name of Calcutta University Commission. Like 
all other commissions, the Calcutta University Commission also stressed at the point of the 
importance of mother tongue in education. The Commission inquired into the affairs of the 
Calcutta University and speculated deeply over the crucial problems regarding the education of 
the natives. It observed: 
“We are emphatically of the opinion that there is something unsound in a system of education 
which leaves a young man, at the conclusion of his course, unable to speak or write his own mother 
tongue fluently and correctly. It is thus beyond controversy that a systematic effort must henceforth 
be made to promote the serious study of the vernaculars in secondary schools, intermediate 
colleges and in the university” (Pattanayak, 1990). 
The report submitted by the Calcutta University Commission reflected the serious and negative 
impacts of the policy of British government to carry on with English as the language of the medium 
of instruction and examinations along with the allowance of writing examination in the vernaculars 
as an option. British had to make amendments in their policy of education to teach the native 
masses in order to make them efficient and proficient in their respective selected courses. This 
resulted in restricting the medium of English only to the colleges and university; and the vernacular 
languages were started to be taught from primary level to the secondary level in schools from 
1920s onwards throughout the country. From this point onwards, English was beginning to be the 
language of higher education and vernacular languages to be the languages at school level which 
is still prevalent in the existing education system of India. 
By the time India reached to its independence, the question of national language and medium of 
instruction became a hot debate. Dr. B. R. Ambedkar was of the view of ‘one language, one nation’ 
and considered that only then the whole India would become a whole unit or a nation. Mahatma 
Gandhi was also of the same view and preferred Hindustani in Devnagari script, which is a mixture 
of both Hindi and Urdu, as the national language. He was unambiguously opposed to have English 
as the official or national language of the country. Gandhi said, “only that language which the 
people of a country will themselves adopt can become the national language. However, virile 
English may be, it can never become the language of the masses.” (Gandhi, 1946). Gandhi’s 
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opposition to English was mainly because it was the portrayal of the sign of psychological conquest 
that the British had made over Indians and considered the mental liberation was more important 
than the physical liberation. Despite Gandhi’s emphasis on Hindustani as the national language, 
no decision was implemented concerning the language of nation or the medium of instruction at 
the time of the framing up of the Indian constitution. 

During the Constituent Assembly Debates on language, various other arguments also emerged 
favoring the cause of Hindi as the national and educational language of the country. On 26" 
November 1949, Algu Raj Shastri of the United Provinces referred to Hindi as the national 
language of India and asked the President of the CA, Rajendra Prasad, to have the Constitution 
passed in Hindi. He said: 

“This language (English) is not the language of the people, it is not the language of the common 
man. I, therefore, request you in the name of Indian nationalism, and in the name of Indian people 
to make a definite announcement in this respect” (Agnihotri, 2015). 

On 10 December 1946, R V Dhulekar made the following statement during the Constituent 
Assembly Debates on language: 

“People who do not know Hindustani have no right to stay in India. People who are present in this 
House to fashion out a Constitution for India and do not know Hindustani, are not worthy to be 
the members of this Assembly. They had better leave” (Agnihotri, 2015). 

In 1950, the Constitution of India was adopted which recognized 15 languages as the languages of 
India included in Schedule VII (14 modern languages and Sanskrit, a classical language). English 
continued as an associate official language along with Hindi for the first fifteen years after the 
Constitution was incepted. The University Education Commission Report of 1949 suggested, 
among other things, that ‘English be replaced, as early as practicable, by an Indian language as the 
medium of instruction of higher education.’ From 1964-66, The Education Commission discussed 
the medium of instruction question and proposed that the mother tongue/regional language should 
be used up to the highest level for instruction and examination, but English should be taught both 
as a subject and as a library language at higher levels (Pattanayak, 1990). Later in 1965, English 
was assured the status of the Associate Official Language of the nation. The Constitution provides 
that English will be the language of the High Courts, Supreme Courts and the Acts of Parliament. 


3.3 POST-INDEPENDENCE SCENARIO OF ENGLISH LANGUAGE IN INDIA 


After independence, English had become strongly connected to India and the Indians that finding 
any language other than English became almost impossible that could serve as a connecting link 
within the various states of India with various languages with their own dialectical varieties, 
cultural background and sentimental connection to their respective languages. It was only English 
that managed to serve the purpose that was the need of the time and it was definitely not devoid of 
its implications. 

The ‘ditch’ was still deepening and being realized at the same time. For those who could not 
manage to learn and had difficulty in acquiring English language, steps were taken to deal with 
that. Ministry of Education, Government of India in 1967 emphasized over the introduction of 
“Compulsory English” subject at undergraduate level to cope with the posed challenge of the 
language of education at higher education level and stated “Compulsory English should be taught, 
not merely in the first and second year of the three year degree course but for all the three years” 
(Loomba, 1993). The Ministry of Education gave the reason for this change was that “this makes 
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for better comprehension of written materials in English — a skill which is essential as ever at the 
post-graduate stage” (Loomba, 1993). 
Introduction of “Compulsory English” clearly indicates to the fact that how much important 
English language had become for the education of the whole nation, and that this course had to be 
taught for all the three years at the undergraduate level. There was also an implied understanding 
with the introduction of “Compulsory English” that it was the only portal to have access to the 
higher level of knowledge. The great literature and scientific works of the West could not possibly 
have been translated into any of the Indian languages, and for the scientific development of the 
country English had to be learnt. It had been argued that “students needed to equip themselves 
with enough competence in English to acquire advanced knowledge. It is perhaps for this reason 
that Compulsory English has become the most widely taught undergraduate course in India” 
(Dash, 2009) 
In 1968, the editors of the book An English Anthology for College Classes had written in their 
‘Foreword’ about the state of students at acquiring English language and how they had come to 
realize about the emerging “enormous gap between how much a student who sought admission to 
a college really knew and how much he ought to have known.” Further they argued that “It would 
not help matters if we merely sat back and blamed the secondary schools. We, therefore, decided 
to do whatever we could to improve the condition of English teaching at this level” (An English 
Anthology, 1968) 
Then in 1968 the three language formula by the Ministry of Education, Government of India was 
brought into the context to deal with the emerging crisis of the language of education. Through 
this formula, it was expected to provide the students with the knowledge and aptitude for 
multilingual education. As per this language formula, the Indian states were required to impart the 
knowledge of three languages which are as follow: 

1. For the first language, the mother tongue or the regional language to be learnt. 


2. For the second language, in Hindi speaking states English or a modern Indian language to 
be learnt and in non-Hindi speaking states Hindi or English to be learnt as the second 
language. 

3. For the third language, in Hindi speaking states English or a modern Indian language not 
studied as the second language to be learnt and in non-Hindi speaking states English or any 
modern Indian language not studied as the second language to be learnt. 


But the formula did not work as productively as it had been expected. This only increased the 
cognitive load on the part of student which complicated the situation further for the learner, and 
the learner struggled to acquire mastery even over one particular language instead of becoming 
efficiently multilingual, and proficiency in at least one language is direly needed for the cognitive, 
academic and scientific growth of the learner. 


As the formula was brought into practice, it led to various other complications. The Hindi-speaking 
states showed less interest in learning an additional modern Indian language (a southern Indian 
language) and the non-Hindi-speaking states resisted the study of Hindi language. Even the 
implementation of the three-language formula was not in accordance with its expectations. 
According to the three language formula, second and third language have to be a modern Indian 
language i.e. a Southern Indian language, but in the Hindi speaking states it is actually Sanskrit, a 
classical language, which does not find place in the three language formula. Subsequently 
controversies emerged in favor and against including the classical language in the formula. In over 
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all practice the three language formula lacked the kind of motivation required to be provided to 
the states and the students to actively participate and put in practice this language formula. 


The disagreement within the states about the selection of the language, high rate of drop outs, 
inadequate facilitation for the proper educational environment, improper infrastructure and the 
unreceptive attitude towards the modern education turned the scenario all the more worse. Due to 
all these emerging complications, the students could not manage to learn and use one particular 
language effectively. Those who managed to get through the school level education with the 
vernacular language and little knowledge of English; they found themselves in a great trouble 
when confronted with the higher education which was only possible through English. Many 
students could not efficiently and productively use any of the languages, they had learnt through 
the three language formula, for higher education purpose as they lacked the proficiency that was 
needed, and the knowledge they had acquired of English language was broken which left such 
students hanging in this ‘ditch’. 


4. CONCLUSION 


The problem of learning English language in India has become all the more intensified since the 
globalization has started influencing all walks of life in the modern world. Learning this language 
has become an important requirement now as it has become the language of the whole world for 
trade, tourism, health services and education. But in India there are still people who have difficulty 
in acquiring this language and with the insufficient knowledge of this language get into the higher 
studies. It is also contributing to the poor research quality being produced currently in India. At 
the school level, except the private schools who charge high amount of money for the education 
in English provided by them, the students learn through their respective vernacular language with 
the broken and distorted knowledge of English language. The scenario becomes problematic for 
such students when they aspire to pursue higher education which is only accessible through 
English language. After finishing the school, such students find themselves absorbed in this ‘ditch’ 
and have to suffer a lot as a consequence of that. This ‘ditch’ has ever been growing since 
Macaulay’s Minute, and still growing because a big portion of Indian population cannot afford to 
send their children in private schools and have to stay contented in a school where education is 
imparted through the vernacular languages. 
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